Gene Caul_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2075 
Symbol 
ID5899530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2221437 
End bp2222744 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content68% 
IMG OID641562564 
Productmajor facilitator transporter 
Protein accessionYP_001683701 
Protein GI167646038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.560518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATC CGTCCATCGA CATGGCCCCC GACCTGGAGC GCGCCACCGT CGGCCGGGTG 
ACAAGGCGGC TGATGCCGCT GTTCTGCCTG ATGTACCTGA TCGCCTACAT CGATCGGCAG
AACGTCTCGT ACGCCAAGCT CGACATGGTC CATGCCCTGG GCCTCACCGA GGCGGCCTAC
GGCCTGGGCG CGTCGCTGTT CTTCATCGGC TATTTCCTGT TCGAGGCGCC GTCCAACCTG
ATCCTGGCCC GGGTGGGCGC GCGGGTCTGG TTCGCGCGGA TCATGTTCAC CTGGGGTCTC
GTCACCCTGG CCCTGGGTTT CACCCAGAAC GCGACGATGT TCTACGTTCT GCGCTTCCTG
CTGGGGGTCA CCGAGGCGGG GTTCTTTCCG GGCGTGCTCT ATGTCCTGAC CCTTTGGTAC
CCGCAGGCCC ATCGGGGCCG GATGGTGGGC TTGTTCATGA TCGCCAGCGC CGTCGCCAAC
GCCGTCGGGG CGGTGTTGGG CGGCCTGCTG CTGGATCTGG ACGGAACGCT GGGACTGGCG
GGCTGGCAAT GGGTGTTCCT GGTCACCGGC GTTCCGGCCG TCCTGCTGGC GCCCTATGTC
CTGTGGCGGC TGCCGGACGG TCCGACCAAG GCGCGCTGGC TGCCGGAGGC CGAACGGGCC
TGGTTGGCCA AGGTTCTGGA TACGGAGCGG GGCGGGGTGG TCGATGATCA TCGCGGCGCC
TGGAAGGCGA TCTTCGACCC GCGCGTGCTG CTGCTGGCGG GCCTCTATAT CGGCATGCCG
CTGGGCGCCT ACGGCCTGAG CTACTGGCTG CCGACCATCG TCAAGTCGTT CGGCGTCTCC
AACAGCGTGA ACGGCCTGAT TAATGTCATC CCCTGGCTGC TGGTCGCTGT GGCCCTGTGG
TTCGTGCCCC GCCACGCCGC GCGCCATGGC GCCAGCGCCT GGCACATCGC CGGACCGTGC
CTGCTCGGCG CCCTTGCCCT GGTCTTGAGC GTGATCGTGC CGGGCTCGGC GTTGAAGTTC
GCCATGTTGT GCCTCGCCGC TCCGGCCATC TTCGCGGCCC AGCCGGTGTT CTGGAGCCTG
CCGCCGAGCT TCCTCAGCGG ACCGAGGGCG GCGGCGGGCA TCGCGGCGAT CAACGCGATC
GGCAATCTGG GCGGCTTCAT CGCCCAGAAC CTGGTGCCGA TGGTGCGCGA CGCGACAGGC
AGCAACCTGG CGCCCATGCT CGCCCTGGCC GCCGTGCTGG TGGTGACCAG CATCCTGATC
TTCTACGCCA TGGCCCGGCT GAACCGCGTG CGATCCAGCG CCGGGTGA
 
Protein sequence
MPDPSIDMAP DLERATVGRV TRRLMPLFCL MYLIAYIDRQ NVSYAKLDMV HALGLTEAAY 
GLGASLFFIG YFLFEAPSNL ILARVGARVW FARIMFTWGL VTLALGFTQN ATMFYVLRFL
LGVTEAGFFP GVLYVLTLWY PQAHRGRMVG LFMIASAVAN AVGAVLGGLL LDLDGTLGLA
GWQWVFLVTG VPAVLLAPYV LWRLPDGPTK ARWLPEAERA WLAKVLDTER GGVVDDHRGA
WKAIFDPRVL LLAGLYIGMP LGAYGLSYWL PTIVKSFGVS NSVNGLINVI PWLLVAVALW
FVPRHAARHG ASAWHIAGPC LLGALALVLS VIVPGSALKF AMLCLAAPAI FAAQPVFWSL
PPSFLSGPRA AAGIAAINAI GNLGGFIAQN LVPMVRDATG SNLAPMLALA AVLVVTSILI
FYAMARLNRV RSSAG