Gene Bcav_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcav_3988 
Symbol 
ID7858051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeutenbergia cavernae DSM 12333 
KingdomBacteria 
Replicon accessionNC_012669 
Strand
Start bp4412118 
End bp4413386 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID643868091 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002883991 
Protein GI229822465 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACGC GCGTTAGTCG AAGGACGGTC CTCGGTGCGC TCGCCGCCGG GGGAGCCGGC 
GCCGGTCTCC TCGCCGCGTG CGGGGCGCCC AACCAGCCGC AGTTCGCGGC CTCCGGAGCG
CGCATCCTCA CCATGTGGGG CGGCTGGGCC GGCGATCAGG CGGGGCAGAT CCAGACGCAG
CTCGACGCGT TCAACGCCTC GCAGGGCGAC TACGAGTTCC GGTACGTCCC GCAGGGCGCG
ATGGAGCAGA AGCTCCTCAC GGGCATCGCT GGCGGGAACA TCCCCGACCT CGTGCTCTGG
GACCGTTGGC AGACGGCGAA GTACGCCCGG CGGGGTGCGC TGCAGTCGAT CGACGCGTGG
GCGGAGCGCG ACGGGTTCGA CCTCAACGCG TTCTTCCCGG AGTCGATGCG CGAGATGCAG
GTCGAGGGCG AGACGTTCGG CCTGCCGCTG CTCGTGGACG CTCGCTCGAT CTTCTACAAC
CGCGCGCACC TCGACGAGGC GGGCATCGCG CCACCGACCA CGTGGGACGA GCTGGCCGAC
GCCGCTGAGG CGCTGACGAC CCGGGAGGGC GGCGCGCTGA AGCGGGCCGG GTTCGAGATC
CAGGACGTCG GGCTGTTCAG CATGTACATG TACCAGGCGG GCGGCGAGAT GCTGGACGCC
TCGTTCACGC GCACGGCGTT CGACGCGCCG GAGGGGCTGG ACGTCCTGAG CTTCTGGGAC
GAGCTGCTCC ACGAGCGTCA GGTGTACGAG CTCGGGTTCT CCGACGGGAT CGACGCCTTC
GCGCAGGGGA TCGTCTCGAT CAAGTACGAC GGGCCGTGGC AGCTCCCCAC CTGGGACGCT
GTCGAGGGTC TCGAGTACGG GATCGTGCCG GCCGTGGCCG GGCCTCGCGG CGACCAGGGC
GCCGGTCTCG GCGGCTTCGG GCTCATCATC CCGACCGGCG CGCCGGACCC CGAGGGTGCG
TGGGAGCTCA TGAAGTGGTG GGCGGGGGAG TCGGCGAACA ACGTGGCGTT CAGCGAGATC
AGCGGCTGGA TCCCGGCCAA CGTCGAGGCC GCGAACGATC CCTACTTCGT GGCCGACGAA
CGCTTCGCGC CGCTCGTGGA GACCATCTCC GTGGCACGGA TCCGCCCACC CGTGCCCGGC
TACTCCGACG TCGAGGGTCT CGCGCTCATC CCCGCCCTGC AGCAGGTGAT GTCCGGGACC
CTCTCGGGCG AGGCGGCGCT CGCGCAGGCG CGAGAGCAAG GCGACCGGAT CCTGGAGGCC
AATCGATGA
 
Protein sequence
MSTRVSRRTV LGALAAGGAG AGLLAACGAP NQPQFAASGA RILTMWGGWA GDQAGQIQTQ 
LDAFNASQGD YEFRYVPQGA MEQKLLTGIA GGNIPDLVLW DRWQTAKYAR RGALQSIDAW
AERDGFDLNA FFPESMREMQ VEGETFGLPL LVDARSIFYN RAHLDEAGIA PPTTWDELAD
AAEALTTREG GALKRAGFEI QDVGLFSMYM YQAGGEMLDA SFTRTAFDAP EGLDVLSFWD
ELLHERQVYE LGFSDGIDAF AQGIVSIKYD GPWQLPTWDA VEGLEYGIVP AVAGPRGDQG
AGLGGFGLII PTGAPDPEGA WELMKWWAGE SANNVAFSEI SGWIPANVEA ANDPYFVADE
RFAPLVETIS VARIRPPVPG YSDVEGLALI PALQQVMSGT LSGEAALAQA REQGDRILEA
NR