Gene Caul_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4786 
Symbol 
ID5902248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5167729 
End bp5168949 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID641565306 
Producthypothetical protein 
Protein accessionYP_001686404 
Protein GI167648741 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.6743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAGG ATCGTATCGT TCTGCTGGAC TCCCTGCGGG GCCTGGCGGT GTTGGGCATC 
CTCCTCTGCA ATATTCCCCT GGTGGCGGTC CCAGAGGCGG TCGGGGTCAG CCTGACCCTG
TGGCCGCACG GCATGGCGCC GGCCTCGGTG GCGGTCTGGC TCGTCACCCA GCTGTTCTTT
CAGCAGAAGT TCTACTCGCT GTTCGCCATG CTGTTCGGCG CCTCGATCCT GCTGGTCGGC
GGCGAGGGCG GCGATGGCGA CCGTCGCAGG ATCCTGATCC TGCGCCTGGT CAGCCTGCTG
GCCATCGGCC TGTTCCACGG CTTCGTCATC TGGCAGGGCG ACGTGCTCAA CACCTATGCG
ATCGTCGGCC TGCTGGCGAT GTGGGCGCGC TCGTGGCCGG CCAAGCGCCT GCTCCAGGCA
GGGATCGGCC TGCATCTTGG TCTGTCGGCC TGGAGCGGCT GGAACCTGCT GACGCGCGTC
GCCAAGGGCG GCGGCGATCC GCCCCCCGCG GCCATGGCCA AATACCTGGC TGAAGCCCAG
GCCGACGGCG CGCAATTTGC GGGAACCTTC GCCCAGTCCC TGGTCCAGAA CGCCAAGGAC
TATGGCGAGT TCGTGGTCGG GTCGTTCACC CACTGGCCGC CGACCTGGCC GCTGCTGGTG
CTGTCGCTGA TCCTGATCGG CATGGGCCTC TACAAGCTCG GCGTCCTGAC CGGCAAGGCC
TCGACGGGCC TCTATCAGGG GCTGATTGGC GCGGGTCTCG GCGCGCTGGT GCTCGCCGGC
ATGGCCGAGA CGATATACGT GCTGCTGCCG AGCCACGACT GGACGATCCG CGGCGTGGCC
CGCTGGCTGC AGAGCGCCAC CGCCCCGGTG GTCACCCTGG GCTATGTGGG CCTGATGGTG
CTGGCGACGC GGACCCGGGT CTGGAAGGCA ATCCCCGCCG TGCTGGCCCC GGTCGGCCAG
ATGGCCTTCA CCAACTATCT GACCCAGTCG ATCCTGATGA CCGTGTTGCT GTATGGCGGG
CGCGGGCCGG GCCTGTACGG CAAGGTCGAT CGCCCCGCAC TGGCCTTGGC GGTCCTGGCC
ATCTGGACCC TGCAGATCCT GTGGTCGCGC TGGTGGATGG CGCGCTTCAC CATGGGGCCG
CTGGAGTGGC TGTGGCGGCT GGCCTATCGC GGGCCGATGC CGCTGCGTCG CGCGCCGGCG
ACGGCTGCGG TCACGGCCTA G
 
Protein sequence
MVKDRIVLLD SLRGLAVLGI LLCNIPLVAV PEAVGVSLTL WPHGMAPASV AVWLVTQLFF 
QQKFYSLFAM LFGASILLVG GEGGDGDRRR ILILRLVSLL AIGLFHGFVI WQGDVLNTYA
IVGLLAMWAR SWPAKRLLQA GIGLHLGLSA WSGWNLLTRV AKGGGDPPPA AMAKYLAEAQ
ADGAQFAGTF AQSLVQNAKD YGEFVVGSFT HWPPTWPLLV LSLILIGMGL YKLGVLTGKA
STGLYQGLIG AGLGALVLAG MAETIYVLLP SHDWTIRGVA RWLQSATAPV VTLGYVGLMV
LATRTRVWKA IPAVLAPVGQ MAFTNYLTQS ILMTVLLYGG RGPGLYGKVD RPALALAVLA
IWTLQILWSR WWMARFTMGP LEWLWRLAYR GPMPLRRAPA TAAVTA