Gene Haur_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1190 
Symbol 
ID5733083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1367183 
End bp1369144 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content51% 
IMG OID641278330 
Productamino acid permease-associated region 
Protein accessionYP_001543966 
Protein GI159897719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000537417 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCA AACAGCTACT CATCGGGAAG CCGTTCCCCA GCAGTAGTGC CCAGCACGAA 
CGGCTCGACA AAATTCGCGG ATTGGCAGTC TTTGCATCCG ACCCAATTAG CTCCAATGCC
TATGCCACTG AAGCGATTAT GCGGGTGCTG ATTTTGATCA GCGCCGCAGC ATTAAGTTAC
ACCTTGCCGA TTGCGATCGG GATTGCGGCC TTGGTGCTGT TGGTCGTCTT GAGCTACAAC
CAAACCATTC ACCACTACCC AATGGGTGGC GGCGCGTATA TGGTCAGTAA GGATAACCTC
GGTCGCACAG CTTCATGGTT GGCAGGTGCT TCGATTCTTA GTGATTATGT GCTGACCGTG
GCGGTTTCGG TTTCGGCGGG GGTTAAAGCA ATTTCTTCGG CCTTCCCCGA TGTCGCATTC
TTTCATGAGC ATCGGGTATT AATCGGAATT GGGATTATCC TTTTAATTAC CTGGCTGAAC
TTACGTGGTG TGCGCGAAAG CGGCACGATC TTTGCCATCC CCACCTATGC CTTTGTGTTT
GGGGTGTTCG TAGTGATCGC GATTGGCCTT GCCCGCTATT TTGGAATTTT CGGCGAGCCA
TTGCCACCAC GCAGCGTCGC AACCGACGAA ACCGCTCGTT CAGGCCTTGA TAACTTTGGC
TTGATCTGGC TGGTCTTGCG GGCCTTTGCT GGTGGTTGTA CCGCCTTGAC TGGGATTGAA
GCAATCAGCG ACGGCGTACA AGCGTTTCGT AACCCAGCGC CAAAAAATGC GATCATTACC
ATGCGAGCCA TGGCCGTCAT GGCCATGACC TTGTTTATTG GAATTAGCTT TATTGCTACC
CATATTCCGA TTACCCTGCT GCATGAAGGT GGCGAGAGTG TACTTTCGCA AATGACACGC
ACGATTGTTG GTAGTGGGTT TCTGTACTAC TGGGTTCAAT TTACCACCAT GTTGATTTTG
ATCCTAGCTG CTAACACTGC TTATGCCGAC TTCCCACGGA TCGCGGCCTT CTTGTCGAAT
GACGGTTTCT TGCCGCGTTG GCTCTCGCGC TTGGGTAGCC GTTTGGTCTA TAGCTCAGGC
GTGATCGCGC TGGCCTTTTT GGCCTCGGCC TTGCTGGCAG CTTTCGGCGG CGAAGAACAT
CACCTGTTGC CATTGTATGC GATTGGGGTG TTCCTCTCGT TTACGCTTTC GCAAGCTGGC
ATGATTGTGA TGTGGCGCAA AGTTGCCAAA CTCAAGCCAG GCGAAAGCCT CGACACTGGC
ATCACCACTC TGCACTATGA GCCAAACTAC AAGCTCAAAC GAATTCCCAG CATCATTGGG
GTTGGCTTGA CGGCTGTGGT TTTGGTGGTG TTGACAGTTA CCAAATTTAC CGAAGGTGCA
TGGCTGATTA TTGTGGCCTT GCCGTTGATC ATGCTTTTGT TCCGCAAGAT CAAAGCACAC
TATGACCATG TGGCAACCAA TTTGAGCCTA ACAGGCTTGA AACCAAGCGA TTTGCGTTCG
CCCGCTGATG TGGCGATTGT GCCAGTTGGC AGCATTCATC GTGGCTCGTT GCGTGCGATC
AAATATGCCT TGAAACTAAC CGACGATGTG CGCGTGGTCC AAGTTGTTGG TAGCGAAGAG
GAAGAAATCA AAACCCGCAA ACGCTGGGAA CAGTGGGACG AAGTGCTGGG CAAGGCCAAA
TTGGTCTTCT TGCACACCGA CTACCGCGAT TATCTCACGC CATTGGTCGA TTACGTCGAT
CAAGTCAACA ACAAGGAATT TCCAGGCGAT TTGATTACCG TGGTTATTCC GGAGTTTGTG
CCCGATTCGA CCATGGCCAA AGTGCTGCAC AACCAAACTG CTGTGATGCT ATTGCTGGCA
TTGCGCAAAT ATGAAGATGT GGTGGTAATC AGCGTCCCTT ATCACTTGCA CTATATTCCA
ACTGGCTCGG AAGATATTGT GGCCCAAAAA CCAGCCGCCT AA
 
Protein sequence
MNIKQLLIGK PFPSSSAQHE RLDKIRGLAV FASDPISSNA YATEAIMRVL ILISAAALSY 
TLPIAIGIAA LVLLVVLSYN QTIHHYPMGG GAYMVSKDNL GRTASWLAGA SILSDYVLTV
AVSVSAGVKA ISSAFPDVAF FHEHRVLIGI GIILLITWLN LRGVRESGTI FAIPTYAFVF
GVFVVIAIGL ARYFGIFGEP LPPRSVATDE TARSGLDNFG LIWLVLRAFA GGCTALTGIE
AISDGVQAFR NPAPKNAIIT MRAMAVMAMT LFIGISFIAT HIPITLLHEG GESVLSQMTR
TIVGSGFLYY WVQFTTMLIL ILAANTAYAD FPRIAAFLSN DGFLPRWLSR LGSRLVYSSG
VIALAFLASA LLAAFGGEEH HLLPLYAIGV FLSFTLSQAG MIVMWRKVAK LKPGESLDTG
ITTLHYEPNY KLKRIPSIIG VGLTAVVLVV LTVTKFTEGA WLIIVALPLI MLLFRKIKAH
YDHVATNLSL TGLKPSDLRS PADVAIVPVG SIHRGSLRAI KYALKLTDDV RVVQVVGSEE
EEIKTRKRWE QWDEVLGKAK LVFLHTDYRD YLTPLVDYVD QVNNKEFPGD LITVVIPEFV
PDSTMAKVLH NQTAVMLLLA LRKYEDVVVI SVPYHLHYIP TGSEDIVAQK PAA