Gene Haur_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2087 
Symbol 
ID5733975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2598983 
End bp2600722 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content51% 
IMG OID641279228 
ProductABC transporter related 
Protein accessionYP_001544855 
Protein GI159898608 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTC CGCTAGCGCG TTATCGTCGT TTACTGGCAA CCTATCTCAA GCCACAGCGT 
GGGCGGGTGC TCGGCTTAGG ATGCTTGGTT TTTCTGGGCA TTGGCTTGCA ATTGCTTAAT
CCGCAAATTA TGCGCCGATT TTTAGATTCG GCGCTCGCTG CTCAACCTTT GGAGCAACTG
ACCAAACTGG CATTACTCTT TATGGGTATC GCCTTTGGAC AACAACTTTT AGCGCTCGGT
GCGGTTTATC TGAGCGAGCA AATTGGCTGG ACCGCAACGA ATGCCTTACG TGCCGATTTG
ACCGCCCATT GTCTGCACTT GGATAGTAGT TTTCACAATC ACAAAACACC AGGCGAGCTA
ATCGAGCGCA TCGACGGCGA TGTTACGACC TTAGCCAACT TTTTTACCCA GTTTATTATT
CAATTGCTGG GCAATGGCTT GTTGCTGCTG GGGATTATTG TGGCCTTGGC CTTGGAAGAT
TGGCGGGTTG CAGTTGGGCT AATCGTCTGT GTGGCAGCGG CGATCAGCCT GTTGCAAAAA
ATGCAACGGG TTGGCGTACC ACTCTGGGGC GAATCACGCC AAGCGAGTGC TGAGCTGTTT
GGCTTTTTAG AAGAACGCTT GGCGGCAACC GAAGATATTC GCTCAAGCGG CGCACGCAAT
TATGTGATGC AACAGCTCTA TCGCCAAATG CTCGTGCTCT ATCGCAAAAC TCGTAAAGCT
GAGCTAGCCA CCGCTTGGCT CTATAACGCT GGCCAATCGA TGTTTATTAT CGCCAGTGGT
TTGGGTTTGG GCATTGGTAT CTACCTGTAT CAACAGCAAC AAACGACAAT TGGTGGCGTG
TATATCATCA CAGCCTATAT TGGCTTGCTC ACCACGCCGC TCGAGCAAAT TTTACGCCAA
ATTCAGGAGT TTCAAAAAGC CAGTGCCGGC ATTTTGCGCA TCGATGAATT GCGCCAGCAG
CGACCAGCAA TCGTTGATGG CACTGGCCCG ACAATCCCGC AACAAGCACT TGATCTGCGG
TTTGAGCAAG TTTCATTTGG CTATAACACC GATACGCCAG TCTTGCGAGC GCTTTCGTTT
GAATTACCCG CCCACCAGAT TTTAGGCGTA TTGGGACGGA CTGGTAGCGG CAAAACGAGC
CTGATTCGGC TTTTGTTACG CTTGTATGAT CCGCATCAAG GTACAATTCG CTTAGGCGGC
ATCGATATTC GCAACACCAG TTTAGCCGAT TTGCGCCGCT CGATTGGCTT CGTGAGCCAA
GATGTGCAAC TATTTCATGC CAGCGTTCGC GATAACCTGA GCTTTTTCGA TCACAGAATT
GCTGATCAGC AATTGCTGGC TGCCTTAGAA ACCCTCGGCC TCACCAACTG GCTTAACAGC
CTTGAACATG GCCTCGATAC GCTGATCAAG CCAAATGGGC TTTCGGCGGG TCAGGCTCAG
CTCTTGGCAT TTGCGCGAAT TTTGCTAAAA GACCCACGAT TAATTATCCT CGATGAGGCT
TCATCACGGC TTGACCCAGC GAGTGAGGCG ATTGTCGAAC GCGCCTTAGA TCGGCTGTTG
GCAGGTCGCA CCGCAATTAT CATTGCCCAT CGGTTGGCAA CCTTACAACG CGCCGATGCG
ATTTTAGTGC TCGATGCTGG CACAATTCGT GAGTTTGGCC CACGCCAAGC ACTGTTACAA
AACCCTCAAT CGCAGTATAG CCAACTGTTG CAACATGGCA TAACCGAGGT GTTCGCATGA
 
Protein sequence
MQFPLARYRR LLATYLKPQR GRVLGLGCLV FLGIGLQLLN PQIMRRFLDS ALAAQPLEQL 
TKLALLFMGI AFGQQLLALG AVYLSEQIGW TATNALRADL TAHCLHLDSS FHNHKTPGEL
IERIDGDVTT LANFFTQFII QLLGNGLLLL GIIVALALED WRVAVGLIVC VAAAISLLQK
MQRVGVPLWG ESRQASAELF GFLEERLAAT EDIRSSGARN YVMQQLYRQM LVLYRKTRKA
ELATAWLYNA GQSMFIIASG LGLGIGIYLY QQQQTTIGGV YIITAYIGLL TTPLEQILRQ
IQEFQKASAG ILRIDELRQQ RPAIVDGTGP TIPQQALDLR FEQVSFGYNT DTPVLRALSF
ELPAHQILGV LGRTGSGKTS LIRLLLRLYD PHQGTIRLGG IDIRNTSLAD LRRSIGFVSQ
DVQLFHASVR DNLSFFDHRI ADQQLLAALE TLGLTNWLNS LEHGLDTLIK PNGLSAGQAQ
LLAFARILLK DPRLIILDEA SSRLDPASEA IVERALDRLL AGRTAIIIAH RLATLQRADA
ILVLDAGTIR EFGPRQALLQ NPQSQYSQLL QHGITEVFA