Gene Haur_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3501 
Symbol 
ID5735362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4409247 
End bp4411166 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content50% 
IMG OID641280648 
Productextracellular solute-binding protein 
Protein accessionYP_001546265 
Protein GI159900018 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000268642 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGAT CAAAGCGCCA ATTGATGAGC TTTGCGCTCA TGTTGGTCCT TGTTGTTCCG 
ATCCTTGCTG CTTGTGGTGG CGAAACAACT CCAACCACAG CTCCAGCAAC GACTGCCCCA
GCAACCGCAA CTACAGATAC CTCATCAGCA GCAACTGCTG AACCAACTGC TGCTGAAGCA
ACCGTCGAAC CAACCGTTGC TGAAGCAACC GCCGAACCAA GCACCGGTAC TACTCCTGAT
ATGGATAAAA CCATCATCAT CGGTATGACC CAATCGCCAG ATACCTTGTT CGGCATTGAA
TCACAATCAA GCGCCACGAC CCAAGTGTTG TCAGCGATTC AACCAGCTTG TTTCACCACT
TTGAGCTACG AATATCAACC AGTTTGTTTC ACCAAATTGC CAAGCTTCGA AGATGGCGAT
GCAGTGACCC AAACCGTCTC AGTTGATAGC GCCTATGCTG GCAACATCGT GATCGATGAT
GAGTTGATCA CCGATACCGC TAGCTTGACC GAAGCAATCG AATTGGAACA AGTTGTTGTG
ACTTGGACCT TGATCGATGG CATGACTTGG GAAGATGGTA CGCCAATTAC CGCTGCCGAC
TTCGTGTTTG CTGCTGAATT GTACCAAGAT CCAGGCATCA AGAACGCTAG CCGCTTCGTG
CTTGATCGCA CCGAAAAATA CGAAGCCAAA GACGAAAAAA CCTTGGTATG GTACGCTGCA
CCAGGTTACA CCGATGCAAC CTACTTCTTG AACACCTTTG GGCCTGAACC AAAGCACGTT
TTGGAAGGCG AAGATCCAGC AACCATTGGT GGTAGCGACT ACGCTAGCAA GCCATTGGCA
TATGGCCCAT ACAAGATTGC TGAAAACACC CCACAAGAAA GCACCAAGTT GGTTGCAAAC
GAAACCTACT GGAAAAAAGG TTTCCCTCTC GTTGGTAATG TTACCTTCAA GTATCTGACC
AGCGAAGATC AAGTGTTGCA ACAATTGGAA AGCGGCGAAA TCGACGTAGT TGGTTCAATT
GGTTTGACCT TGGCTAACGC TCCTAAGCTC GACGAACTCG AAGCTGCTGG CGTGCTCAAG
GGCCAATATG TTCCAGCAAC CGTGTGGGAA CACATGGACT TCGGTGTCGA GCGCAACGAC
GGCCAACCAT CAGTATTCGC TGATGTCAAG TTGCGCCAAG CTGTTGCTTA CGCTGTCAAC
CGCAAACAAA TCATCGATAA CGTCTTGTTC GGCAAGACCG TTGTGATGAA CACCTTCTTG
CCAGCCGACC ACTGGGCTTA TCCACCAAAC GGCGAAGGCT TGGAAGCATA CGAATATGAT
GTAGAAAAAG CTAAGGCTCT CTTGGCTGAA GCTGGTTGGG TTGCTGGCGC TGATGGCATT
CTTGAAAAAG ATGGCACCAA GCTCACCATC CAATTCTACA CCACCGAAAA CAACCAAACC
CGCGAAGCCG TTGCTCAGTT GATCCAAGAA GACCTGAAAG CTGTTGGTAT CGATGTTACC
TTGAACTTCG TTCCAGCAAC CGATGTCTTG TTCAAGAACG GCTCAGAAGG TATCTTGTCA
GGCCGCCGCT TCGACTTAGG TTTGTACGCT TGGGTCAGTG GCCCAGAGCC TTCGACCGCT
CTGTACCTCT GTGAACAAGT GCCAACCGAA GAAAACAGCT TTGGTGGTCA AAACAACACT
GGCTGGTGTA ACCCAGATTA CGACAAGCCA GCCTTGGCCT CACAATCAGA AACCGACCGC
GCCAAGCGGA TTCCTTTGGT CATCGAAGCT CAAAAAGTCT TCAATGCCGA ATTGCCAACC
TTCCCATTGT ACCAACGTGT CAATGTTGGT GCCTACAACG TCAAGGTTAG CGGCTTGGAA
TTGAACCCAA CCAGCCAAGT TGACTTCTGG AACATCGAAA CCTGGGATGT TACTGAGTAA
 
Protein sequence
MLRSKRQLMS FALMLVLVVP ILAACGGETT PTTAPATTAP ATATTDTSSA ATAEPTAAEA 
TVEPTVAEAT AEPSTGTTPD MDKTIIIGMT QSPDTLFGIE SQSSATTQVL SAIQPACFTT
LSYEYQPVCF TKLPSFEDGD AVTQTVSVDS AYAGNIVIDD ELITDTASLT EAIELEQVVV
TWTLIDGMTW EDGTPITAAD FVFAAELYQD PGIKNASRFV LDRTEKYEAK DEKTLVWYAA
PGYTDATYFL NTFGPEPKHV LEGEDPATIG GSDYASKPLA YGPYKIAENT PQESTKLVAN
ETYWKKGFPL VGNVTFKYLT SEDQVLQQLE SGEIDVVGSI GLTLANAPKL DELEAAGVLK
GQYVPATVWE HMDFGVERND GQPSVFADVK LRQAVAYAVN RKQIIDNVLF GKTVVMNTFL
PADHWAYPPN GEGLEAYEYD VEKAKALLAE AGWVAGADGI LEKDGTKLTI QFYTTENNQT
REAVAQLIQE DLKAVGIDVT LNFVPATDVL FKNGSEGILS GRRFDLGLYA WVSGPEPSTA
LYLCEQVPTE ENSFGGQNNT GWCNPDYDKP ALASQSETDR AKRIPLVIEA QKVFNAELPT
FPLYQRVNVG AYNVKVSGLE LNPTSQVDFW NIETWDVTE