Gene Haur_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3800 
Symbol 
ID5735664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4770873 
End bp4771904 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content54% 
IMG OID641280952 
Productinner-membrane translocator 
Protein accessionYP_001546564 
Protein GI159900317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT TGCGCCCACT ACTGCCACTT GTGGCGATTG GCGGCCTTGC AAGCTTGCCA 
TTTGCCTTTG CCGCCGATAC CACCCGCTTT TGGCAAGGGG TTTTCATTCA GATTTTTATT
CTGGCGATTT ATGCCCTGAG CTACGATTTG CTGATGGGCT ACACCGGCAT GATTTCGTTT
GGCCATGCGC TGTTTTTTGG CGCTGGTGGC TATACCGCCG CCATTTTGCT GCGCCAAGCC
GAGCCGCCCT CGATGCTAAT TGTGATTGCA GCGGTGATTG TGGTTGGCGC AACTATCGGG
GCAGCAGTGG GCGCATTGTC GCTACGGGTC AGCGGGGTCT ATTTTTCGAT GATTACCCTC
GCGCTGGCCG AAATTGCCTT CATCCTGTTC AAAGCCGATG ATCCCAAGCT CAAGCCAATT
ACGGGCGGCG AAATTGGCTT GCAAGGGATT GTCGTGCCCG CAGCGATCGA TGCCACGACC
TATCGACTGC GCTTTTACTT TTTGACCTTG GCGTGTATGG TTGGCTTGTA TTACGCGGCG
CGGCGTTTGA TCAACTCGCC AACAGGGCGG GTATTTGTGG CAATTCGCGA GAACGAGCCA
CGCGCCACGG CCTTGGGCTA TAACACGCTG CATTTCAAAT TATTGGCAAC TGCAATTTCA
TCAACGATTG CGGCCTTGGC AGGGATGTTG ATGGTGCTGT ATGAAAAAAG TGCCAGCTTC
GAAATGCTCA GCGTTAATCT CACCATTCAA GCCTTGTTGA TGACGATCAT CGGCGGGATT
GGCACGCTAA TCGGGCCAAT GCTCGGTGCT GCTACCATCC GCTTGCTTGA TCATGGCCTC
AAAGGCGAAT TTATGCAGGC ACTTTTGCCA GCGTGGTTGC GCCCCGACTT AGTATTTGGC
ATCGTGTATG TGGTGCTGGT GTTATTCTTC CCGGCTGGTT TGATGGGAGC AATTCGCAAG
TTGCGCGGCA AAACCAGCCG TAGCTCAGTC GAACGCCTAC GCCAAGCCAT GAAACCAAAG
GCTGAATCAT GA
 
Protein sequence
MQRLRPLLPL VAIGGLASLP FAFAADTTRF WQGVFIQIFI LAIYALSYDL LMGYTGMISF 
GHALFFGAGG YTAAILLRQA EPPSMLIVIA AVIVVGATIG AAVGALSLRV SGVYFSMITL
ALAEIAFILF KADDPKLKPI TGGEIGLQGI VVPAAIDATT YRLRFYFLTL ACMVGLYYAA
RRLINSPTGR VFVAIRENEP RATALGYNTL HFKLLATAIS STIAALAGML MVLYEKSASF
EMLSVNLTIQ ALLMTIIGGI GTLIGPMLGA ATIRLLDHGL KGEFMQALLP AWLRPDLVFG
IVYVVLVLFF PAGLMGAIRK LRGKTSRSSV ERLRQAMKPK AES