Gene Caul_0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0505 
Symbol 
ID5897960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp550144 
End bp551493 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID641560988 
Productmajor facilitator transporter 
Protein accessionYP_001682137 
Protein GI167644474 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTGC AAACCGCGTC TTTCGCTCCA TCGACCGTCC ATCCCGGGCG GCTTCTGGCC 
CTGCTCTGCT TCGCCTATCT GCTCGGCTTC CTGGACCGGA TCATCTTCAG TCTGGCGGTC
CCGGCCATCA AGGCGCAGTT GCTTCTCAAC GACCAGCAGC TGGGCCTGTT GTCGGGGCTC
GCCTTCGCTG TCAGCTACGC CTTGTTCGCG CCAGTGGCGG GCTATTTCGC GGACCGTCGG
TCGCGCAAGC AGATCCTGAT GTACGCCGTC GCGGTCTGGA GCCTCGCCAC CGCGGCGACG
GCGCTGGCGG ATTCGTTCTG GACGATGTTC GCCGCCCGCG CCGTCGTGGG GGTGGGGGAA
GCGACGCTCA TTCCGCTGGC CGTATCCTTG ATCAGCGATA CCCGGACGGG TCACTCCCGC
GACCGGGCGT TCGGCATGTT CCTCGCCGCC GGCGCGGTGG GCAACACCGC AGCCCTGCTG
TTTGGCGGCG CCATCATCCA TTTTGTCACA CGGGCCGGCG GTCTTCATCT GCCCGCGATT
GGCGCTGTGT CTGGCTGGCA GAGCCTGTTC CTCGCCGCGG GCGCCGCTGG CGTCCTGCTG
GTCGCGGTCA TCGCCGCGCT CATGCGCGAT CCGCCGCGCG CTGGTCCTTC GGCGGCGGCG
CCCGCGTCGG ACGCCGGATC GGCCTGGGCC TTCGTTCGCA GCCATCCGAT GCTGATCGCC
ACCCTCTATT TGGGCTTCTC GCTCGTGCAA ATGGCGACTG TGGCGACGCC GTCATGGCTG
ATCGCCACCC TGGTGCGATC TCATGGCTGG TCGGCGGGTG AGACCGCCGT GCGGCTGGGA
CTGACGGCGG GCGTCACCCT GATCGTCGGC GCGGTCGCGA TCGGCCCGCT GATCAAGGCC
GTACGCCAGC GCGGTCACGC CAACGCCGCC CTTCTCGTCG CTCTGCTCTG CGTCGTCAGT
TTCGCCGTCT TTCTCGTCGC AGGCCTGTTC GCGACCGGAA CGGCGCCTGT GCTGACCCTC
GTCGCGATCG CCTTCTTCTT TGGATACACC CCGACCGTCT GCTCCTACGT CATGATGGGG
GAGGTCCTGC CGTCCCATGT CCGAGCCCAA TTGGCCGGCA TCAATACCTT CTCCAACGCC
CTGATCTGCA ACTCGCTGGC GACCTATCTG GTTGGGCTGC TCAGCGACAA GGCCTTCCCC
GGGCCCAGGG GGCTGGCCGT GTCGCTGAGC GTCGTCGTCG TCGCCTCGGT AGTCTTGGGG
TCGCTGGTGA TCCTGCTGGG GCTACGCGCC TATTCCTCGC GGATGAAGGA GTTCGCCGAG
GTCGCTTCCG CGCCCTCGTC GGCGAAATAG
 
Protein sequence
MSVQTASFAP STVHPGRLLA LLCFAYLLGF LDRIIFSLAV PAIKAQLLLN DQQLGLLSGL 
AFAVSYALFA PVAGYFADRR SRKQILMYAV AVWSLATAAT ALADSFWTMF AARAVVGVGE
ATLIPLAVSL ISDTRTGHSR DRAFGMFLAA GAVGNTAALL FGGAIIHFVT RAGGLHLPAI
GAVSGWQSLF LAAGAAGVLL VAVIAALMRD PPRAGPSAAA PASDAGSAWA FVRSHPMLIA
TLYLGFSLVQ MATVATPSWL IATLVRSHGW SAGETAVRLG LTAGVTLIVG AVAIGPLIKA
VRQRGHANAA LLVALLCVVS FAVFLVAGLF ATGTAPVLTL VAIAFFFGYT PTVCSYVMMG
EVLPSHVRAQ LAGINTFSNA LICNSLATYL VGLLSDKAFP GPRGLAVSLS VVVVASVVLG
SLVILLGLRA YSSRMKEFAE VASAPSSAK