Gene Caul_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4202 
Symbol 
ID5901664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4565729 
End bp4567168 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID641564724 
Producthypothetical protein 
Protein accessionYP_001685824 
Protein GI167648161 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.246317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGT TCTCGGCCGC CGCGGCGGTT GCGGCGGAGG GTGAACCGAT CGCCCCGCTG 
AAGTCCGGCC TGTCGCGCGG CGCGCTCGCC TGGATCCTGC AGCAGGGCGC GCGCGACCCC
TATGTGATCC TGATCACCAT CTACATCTTC TCGCCCTACT TCTCGCGGGT GCTGGTCGGC
GATCCGGTCA AGGGTCAAGC CGTGGTGGCC AATCTCTCGA CGATCTACGG GGTGCTGACC
GCCCTGACCG CCCCGCTGCT GGGGGCGATG ATCGAGCAGT ACGGACCGCG CAAGCCGATG
CTGGGCCTGG TGCTGGGAGT GATGGTCCCG GCGCTGGCGG CCCTGTGGTG GGCCATGCCG
GTAGGCGGAC TGCCGCTGAT GGTGACCAGC GCCGCGCTGA TCGTGCTGGG CCTGGTCTAT
AATTGGGGCG ACGTGCTGAG CAATTCGCTG CTGGGCCGGG CCGCTGGTCC CGTTCCAGGT
CGCGCCGCCC TGGTCTCGGG CCTGGGCTAC GCGGTGGCCA ACGGCCTGTC GGTGGCGCTG
CTGGTGTTCA TGCTCTGGGG CATGGTGCTG CCGGGCCAGG TTAACTGGCC GGGCGTGCCG
CACGCGCCGC TGTTTGGCCT GGACGCCAGC AAGAACGAAC CCAGCCGGAT CTCCGGTCCG
ATGGCGGCGG CGGTGATGCT GCTGGGCGCG ATCCCTTTCT TCCTGTGGAC CCCCGACGCC
GCGCGCACCG GCCGCAGCTG GATGGCCAGC ATGCGGGCCG GGATCGCCAT GCTGCGCGAC
ATCTTCGGCA ATCTGCGGGG CCATCGCGAC GTCGCCCTCT TTCTGGGCGG ACGCATGCTC
TACTGCGACG GCATGACCGC GCTGCTGGTG TTCGGCGGGC TGCTGGCGGC GGGCCTGATG
CGCTGGGGCG CGCTGGAGAT GCTGGCCTAC GGTATCTGCC TGAGCATCTT CGGCGTGGTC
GGCGGCCTCG TCGCGCCGTG GTTCGACCGC ACCCTGGGTC CGCGCAAGGC GGTGCAGCTG
GAGATCGCCG CCTCGCTGCT GATCCTGATC GCCACCCTGG GCATGGGGCG CGAGAAGATC
CTCTATTTCT GGGCCTACGA TCCGGCCGCC CATGCTCCGG TCTGGAACGG GCCGCTGTTT
CGCACGGCGC CGGAGCTGGT CTATCTGGGC CTGGGCCTGC TGATCGCGGT GTTCGTCACC
GCGCAGTACG CCTCCAGCCG CACCCTGCTG ATCCGCCTGT GCCCGCCCGA CAAGACGGCG
GCGTTCTTCG GCCTCTACGC GCTGTCGGGC ACCGCCACCA TGTGGATCGG CTCGCTGCTG
GTCGCCCTAG CCACCGCCAT CTTCAAGAGT CAGATCGGCG GCTTCCTGCC GGTCGCGGCC
CTGCTGCTGC TGGGCTTCTG CGTGCTGTTC TGGGTCAAGG GCGGCGAGCG CGAGGGCTGA
 
Protein sequence
MSEFSAAAAV AAEGEPIAPL KSGLSRGALA WILQQGARDP YVILITIYIF SPYFSRVLVG 
DPVKGQAVVA NLSTIYGVLT ALTAPLLGAM IEQYGPRKPM LGLVLGVMVP ALAALWWAMP
VGGLPLMVTS AALIVLGLVY NWGDVLSNSL LGRAAGPVPG RAALVSGLGY AVANGLSVAL
LVFMLWGMVL PGQVNWPGVP HAPLFGLDAS KNEPSRISGP MAAAVMLLGA IPFFLWTPDA
ARTGRSWMAS MRAGIAMLRD IFGNLRGHRD VALFLGGRML YCDGMTALLV FGGLLAAGLM
RWGALEMLAY GICLSIFGVV GGLVAPWFDR TLGPRKAVQL EIAASLLILI ATLGMGREKI
LYFWAYDPAA HAPVWNGPLF RTAPELVYLG LGLLIAVFVT AQYASSRTLL IRLCPPDKTA
AFFGLYALSG TATMWIGSLL VALATAIFKS QIGGFLPVAA LLLLGFCVLF WVKGGEREG