Gene Haur_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3180 
Symbol 
ID5735055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4020308 
End bp4022248 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content53% 
IMG OID641280326 
Productradical SAM domain-containing protein 
Protein accessionYP_001545945 
Protein GI159899698 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTCA TGCTAAAGAT CGTGCTACTC CAACTTCCAG TGCCCAGCAA TCCGGCGGCG 
AATGTGCCGC TGGCAGCGGG CTACCTCAAA GCATGGGCGT ACAACCAAGG CTTACTTGAG
CGTATGCAAA TTGAGATTGT GCCACGTGAT ATTGCCGACC GTGGTGGTGA TCGCCTGATT
TGTAATTGGA TCATCGCTCA ACAACCACAC GTTCTTGGGA TTTCGCTGTA TACCTGGAAT
AGCGAACGTT CGTTGCATCT TGCTAGTCAA CTCAAACAAG CATTACCAAA TTTGATTGTG
GTGGTTGGCG GCCCCGAAGT GCAGCGCAAT AACACCTGGG TTTTGCAACA TCCAGCGGTC
GATGTTGCAG TAGAAGGCGA GGGCGAACAA ACCTTCAGCG AATTACTGCT GGCGCTCGAA
CACCAGCACC AGCAAATTAG CTTGCCGATG CATAACCAAA CCCAATTGCC CTACCCGCTG
GTGGCTGGCA CATTGCAATA TCACGCGAGC CAATTGCATG CTGGCTTACC CCGCCCCGCC
ATGGGGAGCC TTGATCCAAT TCCATCGCCC TATTTGCTGG GCTTTTTGGA GCTACGGGCT
GGCGAAATTG CCTTTATCGA ATGTTCGCGC TGGTGTCCTT ACGGCTGTAC CTTCTGCTTG
TATGGCCGCA ACATGGGCAC AAAATTGGGT GGTCGCATGT TTGGCAGCCA ACGGGTGTTG
GATGAAGTGG CCTGGGCACG CCAGCAGGGC GCACGGGCAA TTCATTTTGT TGAGGCGAAT
CTCAATTTGC TGCCCAATTT TCGCGAATTG ATGCGCGGCC TGCAAAGCAT CAATCAGCCT
GAACCAACCC CAATTTATGC CGAATTACGT GGCGAACACC TTAAGCCAGA AAGTGTTGAG
GCCTTGGTGC AAGCAGGCTT GACTGTGGCC GAAGTTGGCT TGCAAAGCGC CAATCGCACA
GCCTTGCAAG CCGTTGGGCG GCGTACCGAC CTCGAAAAAT GGGCCGAAGG CACGCGCCGT
CTGTATCAGC ATGATGTAGC GGTGTTGCTC GATGTGATTT TGGGCTTGCC CGAAGATGAT
GCTGACAGCA CCCATGCTAC GATCGAATGG ATTCAGCAGC AGCAACTTGG CGATTACGAT
ATTTTTACCT TGCAAGTATT ACCTGGCACA GGCGTGCGCA ACGATGCCGA ACGCTTTGGC
ATGCACTTTC AAGATCGCCC ACCCTACTAC ATTTTGGCCA ATCATTGGCT CAACTACCAA
CAACTTCGCG CCCTACGCTG GGATTTACGT GAACAAGCCG GGCTTGATCC CTTGGCAATC
GAAGGCATGC CGCAACCAGC TTGCGATGTT TGGCTCCAAG CCTGCGAGCA AGCAATTCGC
GTGATCGATC AGCCAATTCG TACTATCGTT TTGGATTGTC GGGCTGAATT CAGTTTAGAC
GAATGGCGGG AGCAAGGCCA ACATTATGCT GATCAGGTTG CCAGCCATGT GGTGGTTATC
GCGCATCATA GTGATTTGGC CGTAATTGAG GCCTTTTGCT GGCCCATCGC CCAAGCCAAC
CTCACGATTC ACTGGGATGT TGTGCTCGAT CAGCCGATCG CCCCCAACGC ATTACGTCAA
CTCCAACAAC GCTGGCCGCA TACCATTGGC TACCTCGATC GCATAGCGGT CTATCGGCGC
TGGCAAGCCG ATCCAGCGTG GGTGCAAGTC ACACCGCGCT GGTGGATTCG CTGCGATTGG
CAACAAGCGC TTGATCCATT GAGCTACGAA GGCATTGCCG AGGTCGTTTG GCAGGTCGCA
GCCGATCAAG CTAGTGTGGC GATTCCGGTT CTGAATCGCC GGGGTGGCAC GGGGATTGTG
ATTGAAGCTG AGCAATTTGA GCCAACATGG CAGGAACAAA GTGAGCATTT AGCAATTTTG
GCTCCCCTGC CACACGATTA A
 
Protein sequence
MYFMLKIVLL QLPVPSNPAA NVPLAAGYLK AWAYNQGLLE RMQIEIVPRD IADRGGDRLI 
CNWIIAQQPH VLGISLYTWN SERSLHLASQ LKQALPNLIV VVGGPEVQRN NTWVLQHPAV
DVAVEGEGEQ TFSELLLALE HQHQQISLPM HNQTQLPYPL VAGTLQYHAS QLHAGLPRPA
MGSLDPIPSP YLLGFLELRA GEIAFIECSR WCPYGCTFCL YGRNMGTKLG GRMFGSQRVL
DEVAWARQQG ARAIHFVEAN LNLLPNFREL MRGLQSINQP EPTPIYAELR GEHLKPESVE
ALVQAGLTVA EVGLQSANRT ALQAVGRRTD LEKWAEGTRR LYQHDVAVLL DVILGLPEDD
ADSTHATIEW IQQQQLGDYD IFTLQVLPGT GVRNDAERFG MHFQDRPPYY ILANHWLNYQ
QLRALRWDLR EQAGLDPLAI EGMPQPACDV WLQACEQAIR VIDQPIRTIV LDCRAEFSLD
EWREQGQHYA DQVASHVVVI AHHSDLAVIE AFCWPIAQAN LTIHWDVVLD QPIAPNALRQ
LQQRWPHTIG YLDRIAVYRR WQADPAWVQV TPRWWIRCDW QQALDPLSYE GIAEVVWQVA
ADQASVAIPV LNRRGGTGIV IEAEQFEPTW QEQSEHLAIL APLPHD