Gene Haur_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0321 
Symbol 
ID5732231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp381544 
End bp382752 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content52% 
IMG OID641277445 
Productmajor facilitator transporter 
Protein accessionYP_001543101 
Protein GI159896854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAACCGC AAGCGCGTTT TAGCTATTTG GTTTTATCGT TTGGTACTCG ACTGCTCTTA 
AGTTGTATTT TTACCGCCTC AATGCTCTAT CGAATTCAAG TGCTCCAGCT TGATCCATTG
CAATTGGTTT TAGTTGGCAC AATGCTTGAG GCTGCCGCCT TCGCCTTAGA AATTCCCACG
GGTGTGGTTG CCGATGTCTA TAGTCGGCGA TTGTCGGTCG TGTTGGGAGT AGCCTTTTTG
GGCTTGGGAG CGCTCAGCGA GGTTTGGCTG GGCAGTTTTG TTGGCAGTTT GTTAGCGCAA
GTTGTTTGGG CCTTGGGTTA TACCCTCATG AGCGGCGCGA CCGAAGCTTG GATCACCGAT
GAACTCGGAG TTGAAGTGGT TGAAGCGTTG TTTCTCAAGG CCGCCCAACT CAACAGCATC
GCCAGTTTGC TTGGCATCGG GCTGGGGGTG GCGTTGGCAA CTATTGGCTT GGTTTGGCCA
ATCGTTGGTG GTGGCCTCGC TTTGGTTGGT TTAGCGATCA GCATTCGCTG GTTTATGCCA
GAAACCCAGT TTAGCCCAGC GCCAGCTGCT GAACGCCAAA GTTGGGGCCG CTTCGGCGCA
ACCCTCGGCC ATGGTTGGCA AGCGGTACGA GGACAACCAG TATTATTGAG TATTATGGTG
ATGAGTGCCA TCGCCGGAGC CGCTTCTGAA GGCTACGATC GGCTGTGGGA AGCGCACTTG
CTCAACAATA TTGGCCTACC AACATGGCTT GATCTCCAGC CAGTTAGCTG GTTTGGTGCA
ATCGCCGCAC TGGGAACCTT GGCTAGTTTG GTGGCAGTTA TCGTGATCAA ACGCTTAGCC
AGCAACACCA CCGAGCGCAG TATTTGGCTG TTACGCTGGC AATATGGCTT GTTGGCAGCA
GGCTTGCTTG GTTTAGCGCT CACCCAACAA TTTGCTATAG CCTTATTTTG GATTATGCTC
ATTCGCATGC TACGTCAAAG CATTCAGCCG CTGTATAGTG CTTGGCTCAA TCGGCTGATT
GAAAGTCGCA GCCGTGCGAC AATTATTTCG ATCAATAGCC AAGCTGATGC CTTTGGCCAG
ATTCTGGGCG GCCCGATCAT TGGCTTAATC GCTAGTCGCA TGGGACTCCC AGCGGCGTTT
GTGGCAGCAA GTTGCTGTTT ATGGCCAATG CTGGTGTTAT TACGGCAGAG GCAACTAAAC
CGCCAATAA
 
Protein sequence
MQPQARFSYL VLSFGTRLLL SCIFTASMLY RIQVLQLDPL QLVLVGTMLE AAAFALEIPT 
GVVADVYSRR LSVVLGVAFL GLGALSEVWL GSFVGSLLAQ VVWALGYTLM SGATEAWITD
ELGVEVVEAL FLKAAQLNSI ASLLGIGLGV ALATIGLVWP IVGGGLALVG LAISIRWFMP
ETQFSPAPAA ERQSWGRFGA TLGHGWQAVR GQPVLLSIMV MSAIAGAASE GYDRLWEAHL
LNNIGLPTWL DLQPVSWFGA IAALGTLASL VAVIVIKRLA SNTTERSIWL LRWQYGLLAA
GLLGLALTQQ FAIALFWIML IRMLRQSIQP LYSAWLNRLI ESRSRATIIS INSQADAFGQ
ILGGPIIGLI ASRMGLPAAF VAASCCLWPM LVLLRQRQLN RQ