Gene Caul_4009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4009 
Symbol 
ID5901471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4339907 
End bp4341127 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID641564530 
Producthypothetical protein 
Protein accessionYP_001685632 
Protein GI167647969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.281652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTT GGTCCAGGCT GCTCGGAGGG CTGCTGGGAT TGGCGGCGCT CGCCGCCGCG 
CACGGCGTCG CTCTGGCCCA GGCGCCGGAC GCCAGCGTCC AGCATCCAGA CTATGCCGAT
CCAGGGCTGT GGCTGTGCCG GCCCGACCTG GCGGACAACC GCTGCAAGGT CGACCTCGAC
GCCACGGTGA TCGCGCCCAG CGGCAAGATG ACGGTCGAGC GCTACGTCCC GGCCAAGGAC
CCGAAGATCG ACTGCTTCTT CGTCTATCCC ACGGTCTCCA ACGATCCGGG TTGGATCTCG
GACTTCTCGC CCGACGCGGC CGAGTGGGAC GACATCAAGG TGCAGTTCGC CCGCTTCGGG
TCGGTCTGCC GGCAGTTCGC GCCGCTGTAT CGCCAGGGGA CGCTGCGGCG GCTTCGGGCG
CCGAGCGGCG GGCCGGCCCC GGTGGGGGCG CAACCGGCGC CGGGCCTTGG CGGCTTCTCG
GACGTGGTCG ACGCCTGGGC CTGGTACATG GCCAACGAGA ACAAGGGCCG GGGCGTCGTC
CTGATCGGCC ACAGCCAGGG CGGCCTCATG ATCACCCGGC TGATCGCCCA GGAGATCGAC
GGCAAGCCCG TCCAGAAGCA GCTGATTTCC GCCCTAATCC TGGGGGCGCC GGTCATGGTC
CCTCCCGGCA AGGACGTCGG CGGTTCGTTC ACGTCGGTCC CCCTGTGCCG CACCGACACC
CAGGTCGGCT GCGTGATCAC CTACGTGACT TTCCGCGACC GCCTGCCGCC GCCTTCGACC
TCGCGCTTCG GCAAGGCCCG CGACGGGCTG CGCGCCGCCT GCGTCAATCC GGCCAGCCTG
GCCGGCGGCT CGGGCCAGCC GGAGTCCTAT TTCATCACCA ACGGTTTCCT GAACGGCTCG
GGCGGCGACC TCCAGCCTGA ATGGGTGCGG CCGATGCGGC CGATCGGAAC CTTCTTCGTC
AAGGCGCCGG GGCTGGTCTC GACCGAATGC GTCGAGAGCG GTGATTTCAA CTACCTGGCC
CTGCACGTGA ACGGCGATCC AAGGGATCCG CGCACCGACG AACTGGGCGG CCAGATCATC
CGCCACACCG GCGTCGACCT GTCGTGGGGG CTGCATCTTC TCGATGTCGA TCACTCGATC
GGCACGCTGA TCCGCATCGT TCGCAGGCAA GGGGAAACCT ACGAGACGGG CGAGCGCAGA
GCGGGTTCGC ATCAATACTG A
 
Protein sequence
MKAWSRLLGG LLGLAALAAA HGVALAQAPD ASVQHPDYAD PGLWLCRPDL ADNRCKVDLD 
ATVIAPSGKM TVERYVPAKD PKIDCFFVYP TVSNDPGWIS DFSPDAAEWD DIKVQFARFG
SVCRQFAPLY RQGTLRRLRA PSGGPAPVGA QPAPGLGGFS DVVDAWAWYM ANENKGRGVV
LIGHSQGGLM ITRLIAQEID GKPVQKQLIS ALILGAPVMV PPGKDVGGSF TSVPLCRTDT
QVGCVITYVT FRDRLPPPST SRFGKARDGL RAACVNPASL AGGSGQPESY FITNGFLNGS
GGDLQPEWVR PMRPIGTFFV KAPGLVSTEC VESGDFNYLA LHVNGDPRDP RTDELGGQII
RHTGVDLSWG LHLLDVDHSI GTLIRIVRRQ GETYETGERR AGSHQY