Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3107 |
Symbol | |
ID | 4028748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3461834 |
End bp | 3463444 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637968321 |
Product | protein of unknown function DUF513, hemX |
Protein accession | YP_575150 |
Protein GI | 92115222 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.52248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAC GACAGGACCA GGATGACAAG CAAACGCCCG CCGAGGGGGC GACGTCCAGC GATGAGACGT CCGAGTCGCC GGCCACCCGT GACACGACGT CGGGCAAAGC CGATACGGCG GACCGAGTAG CCGACGCATC GCACGGTGCC TCGGACAAGG CGGCGGAGGA CGCTGCGTCG TCCAAGGCAT CGTCGAGTGA CGCGTCGACG GTCTCGACGT CGGGAGATAC GGCATCCTCG AGTGACGAGA CGCCCGCCTC CAAGACGTCT ACCTCCTCCC GCTCAGCGTC CCCCACACAG CGCGATGCCC AGGCCGCGCG TGACACGTCG ACCCAGTCGA GCACGCAGGG CAAGACGGCC TCGTCGTCCT CGAAAAGCGC CACGCGTGCT GCGTCCAAGC GCACAGGCAA GGCTGCCGGA GGGAGCTCTT CCGGGGCGGC GTCCACCGCC GGCGCCAGTG CGTCGGGCAA GCCGAGCGCG AGCAAGGCTG AGGCGAGCCA GGGCACGCCG CCGCCCGGTG GTGCGTCGAC GACTCCATCG CGAGGCGGCA ATGGCAGTGG GCGCGGCTTG GCGATCCTGG CTCTGGTGAT TGCCGTGATC GTGGCCATCG TCGTGGCGCT GGGGCTGGGG TATGGCTGGA AGCGTCTGCA GGCACAGCAG GCCGAGCTGG CCAGCGCGGG CGAGGAGAAT GCACAGACCA TCGCGACGTT GCGTGATCGG CTCGATCAGC GTGAGCAGGC CTTCGCGTCG TTGCGCGACG ATTTTGCGTC GTACCGCCAG GATGTCGACG ACAACCTGGA CAAGGTGCTG GCCGAGCTGG CTGAGGAGCA GGACGCCGAT CCTCGCGAGT GGTTGCATGC CGAGGCCGAA TACCTGCTGC GACTGGCCAA CCAGCGGCTG CGCCTGGAGC GTGACGTGCA GGGCGCCCAG GCCTTGCTCG AGGCCGCCGA CCAGCGGCTT GCCGAGGCGG ACAATCCGGC CCTGATCCCG GTCCGCCGGG CGATCCAGTC CGAGCTTGCG GCACTGGATT CGGTGCCTCG GGTGGATCAG ACGGGGCTGT ATCTGGCGCT GATGGCACAG CAGGAGCAGC TGGCGACATT GCCGCTCAAG CAGGACGTGG AAGAGATCGC GGCTCAGAAC GCCGACGACT CGCCGCTGGA AGGTGGCTGG CGCGAACAGC TCGCGCGGCT GGGCGGCGAG CTCAAGGATC TCGTCGTCGT GCGGCGTCAT GACGAGGCGC TGGAAGCCCT GATGACGCCC GAGCAGGAAT CGTATCTGCG TCAGAACGTA CGGCTGCTGC TCGAGCAGGC GCAACTGGCC CTGCTCAAGA CCGAACCCAA GCTCTATCGG GCCAGTCTCG ACAAGGCCAC GACCCTGATC GAGCGCTATT ACGATACCGA GCGCGAGGCA GTGACGACCT CCCTCGAGCG TCTCCGGTCG ATGCGCGACA AGACGATCCG TCCCGAATTG CCGGATATCA GCGAGTCTCA GCAGACGCTC AAGGATTTCA TCGAGAAGCG CTTCCAGGAC AATGGCAGTG ACGGCAACGC GACCCGGCAG GGTGCCGGCG ATGCCACGGC CCCATCGGGC GATGAAGGAG ACAGCGCATG A
|
Protein sequence | MSKRQDQDDK QTPAEGATSS DETSESPATR DTTSGKADTA DRVADASHGA SDKAAEDAAS SKASSSDAST VSTSGDTASS SDETPASKTS TSSRSASPTQ RDAQAARDTS TQSSTQGKTA SSSSKSATRA ASKRTGKAAG GSSSGAASTA GASASGKPSA SKAEASQGTP PPGGASTTPS RGGNGSGRGL AILALVIAVI VAIVVALGLG YGWKRLQAQQ AELASAGEEN AQTIATLRDR LDQREQAFAS LRDDFASYRQ DVDDNLDKVL AELAEEQDAD PREWLHAEAE YLLRLANQRL RLERDVQGAQ ALLEAADQRL AEADNPALIP VRRAIQSELA ALDSVPRVDQ TGLYLALMAQ QEQLATLPLK QDVEEIAAQN ADDSPLEGGW REQLARLGGE LKDLVVVRRH DEALEALMTP EQESYLRQNV RLLLEQAQLA LLKTEPKLYR ASLDKATTLI ERYYDTEREA VTTSLERLRS MRDKTIRPEL PDISESQQTL KDFIEKRFQD NGSDGNATRQ GAGDATAPSG DEGDSA
|
| |