Gene Caul_4600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4600 
Symbol 
ID5902062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4973526 
End bp4974854 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID641565119 
Productmajor facilitator transporter 
Protein accessionYP_001686218 
Protein GI167648555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.578788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.252114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCA ACGCCGAAGA AGCCTTGCCC TTGCCAGCGG CGTCGACCCT CGTCCGGGGG 
GCCACGCCCG ACGGGCGGGC CTGGGGCGCC CTGGGGTGTC TCTGGTTCAT CTACGTCCTG
AACTTCCTGG ATCGGCAGCT GCTGTCGATC CTGGCCAAGC CCATCCAGGA CACGCTGCAC
ATCACCGATG GCCAGCTTGG GCTCCTGGGC GGTCTCTATT TCGCGGCGTT CTACTGCTTC
ATCGCCATTC CCGTGGGCTG GCTCGCCGAC CGAACCAACC GCGTGACCGT TCTGGCCATC
GCCTGCGGCG TCTGGAGCGC GGCGACCATG GCGTGCGGCG TGGCGGGGTC GTATGGCGCT
TTCGCCGCCG CGCGCATGAC CGTCGGCTTT GGAGAGGCGG GCGGGGTCCC GCCGTCCTAT
GCGATCATCA CCGACTATTT TCCGCCGGGA CGCCGGGGCA GGGCGCTGGG CCTCTACAAC
CTCGGCCCGC CGGTCGGCGC GGCCCTGGGC ATTGCGTTTG GAGCCTCCGT GGCGGCGGCG
TTCAGCTGGC GCGACGCCTT TCTGGCCACT GGCGCCGTCG GTCTTGTCGC AGCCCTGGCG
CTGAAGTTCA TCGTGCGGGA GCCCCCGCGC GGCGGACTTG ATCGAGCGGT GGGCGAGGCG
GCTCCGCCCC GGGCGGGCTT TGGCGAAACC CTGCGGATGT TCTTCTCCCA TCCAGCGTTG
CTGCTGGTTT CCCTGGGCAG CGGCGCGACC CAGTTCGTCA CCTACGGCCT GGGCAATTTC
GCCACCCTGT TCCTGATGCG CGAGAAGGGC ATGACCCTGG GCGAGGTCGC CGTCTGGTAC
GCCGTGGTCG TGGGCGTGGG CATGAGCGTG GGCATCTTCG GGTCCGGATG GCTTATCGAC
CGCTTCACCC GCACATCGAG GCGCGCCCAC GGCCTGGCGC CGGCCATCGC CCTGATCCTG
GCCGTCCCCT GCTATCTGGC CTTCGTCTGG GCGCCCACCT GGCCGGTGGC GCTGGCGTTC
CTGCTCCCGG CGATGTGCCT GAACCACGTC TATCTGTCGT CCGCCGTCAC CCTGGTCCAG
ACCGAGGTCA GGCCCAATCA GCGGGTGATG TCGGGCGCCC TTCTGCTGCT GGTGATGAAC
TTCATCGGCC TGGGCCTTGG CCCCACCTAC GTCGGGGCGG CGAGCGATTT CTTCCGTGGC
GCGCACCCCG GCAACTCGCT GCAGATCGCG CTCTACACGC TCGCGCCGTT CTACGCCCTG
GCCATCGGCC TGTTCCTCTG GCTGGCGCAT ATCCTCGGCA GGTCGTCCAA AGTCGGAGCT
CTTCAATGA
 
Protein sequence
MMSNAEEALP LPAASTLVRG ATPDGRAWGA LGCLWFIYVL NFLDRQLLSI LAKPIQDTLH 
ITDGQLGLLG GLYFAAFYCF IAIPVGWLAD RTNRVTVLAI ACGVWSAATM ACGVAGSYGA
FAAARMTVGF GEAGGVPPSY AIITDYFPPG RRGRALGLYN LGPPVGAALG IAFGASVAAA
FSWRDAFLAT GAVGLVAALA LKFIVREPPR GGLDRAVGEA APPRAGFGET LRMFFSHPAL
LLVSLGSGAT QFVTYGLGNF ATLFLMREKG MTLGEVAVWY AVVVGVGMSV GIFGSGWLID
RFTRTSRRAH GLAPAIALIL AVPCYLAFVW APTWPVALAF LLPAMCLNHV YLSSAVTLVQ
TEVRPNQRVM SGALLLLVMN FIGLGLGPTY VGAASDFFRG AHPGNSLQIA LYTLAPFYAL
AIGLFLWLAH ILGRSSKVGA LQ