Gene Caul_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3142 
Symbol 
ID5900597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3406346 
End bp3407506 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content68% 
IMG OID641563645 
Producthypothetical protein 
Protein accessionYP_001684767 
Protein GI167647104 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0531471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCCG TCAAGGCGGA GGTGCGCGTC CATCGGCGCA TCGCCGAGAT CGGGCGCGAC 
GCCTGGGACG CCTGCGCCGC GCCGTCGGGC GATCCGTTCG TCAGCTACGA TTTCCTCGAT
GCGCTGGAAG AGAGCGGTTG CGCCGTCGAA CGCACCGGCT GGGCGCCGCA GCATCTTTCC
GTCCAGGACG AGACCGGCCG CGTGGCGGCG GTCATGCCGC TTTATTTGAA GTCCCACAGC
CAGGGCGAAT ACGTCTTCGA CCACAGCTGG GCCGACGCCT ACGAGCGAGC CGGCGGACGC
TACTATCCCA AGCTCCAGTG CTCGGCGCCG TTCTCGCCGG TCACGGGGCC CCGGCTGATC
GTGCGGCCCG ATATCGACAT CGACGACGGT CGCTCGGCCT TGCTTGGCGG GGCGCTGACC
CTGTGCGACC GGCTGAACGC CTCGTCGCTG CACGTGACCT TTCCGAAGGC CGACGAGTGG
GAATGGATGG GCGAGCGGGG CATGCTGCTT CGCCAGGACC AGCAGTATCA CTGGTTCAAC
AACGGCTACG CGACCTTCGA CGACTTCCTG GCGGCCCTGT CGTCCAACCG CCGCAAGACC
ATCCGCCGCG AGCGTCGGGA CGCCCAGGCG GGCCTCGAGA TCGTCGCCCT GACCGGCGCC
GAGCTCACCG AGGACCACTG GGATGCTTTC TTCGGCTTCT ACATGGACAC CGGCGGTCGC
AAATGGGGGC GGCCCTATCT GAACCGGGCG TTCTATTCGC TGCTGGGCGA GCGGATGGCC
GAAAAGGTGT TGCTGATCCT GGCCCGCCGT CCAGGCGGTC CGTGGATCGC CGGGGCGCTG
AACCTGATCG GCGGCGATTG CCTCTATGGC CGCCACTGGG GCTGCACCGA GGACGTGCCG
TTCCTGCACT TCGAGCTCTG CTACTATCAG GCGATCGAGC ACGGCATCCG CCTGGGCCTG
CCGCGGGTCG AGGCGGGCGC CCAGGGGCAG CACAAGATCG CTCGCGGCTA TCTGCCCAGC
CCGGTCTATT CGGCCCACTG GATCGCCGAT CCGGCCCTGC GCGAGCCGGT GGCCCGCTAT
CTGGAGCGTG AGCGCGAAGC GGTCAGCGCC GAGATCGAAA TGCTGACCGA GGAATTCTCG
CCGTTTCGGC ACGAGGGGTA G
 
Protein sequence
MTAVKAEVRV HRRIAEIGRD AWDACAAPSG DPFVSYDFLD ALEESGCAVE RTGWAPQHLS 
VQDETGRVAA VMPLYLKSHS QGEYVFDHSW ADAYERAGGR YYPKLQCSAP FSPVTGPRLI
VRPDIDIDDG RSALLGGALT LCDRLNASSL HVTFPKADEW EWMGERGMLL RQDQQYHWFN
NGYATFDDFL AALSSNRRKT IRRERRDAQA GLEIVALTGA ELTEDHWDAF FGFYMDTGGR
KWGRPYLNRA FYSLLGERMA EKVLLILARR PGGPWIAGAL NLIGGDCLYG RHWGCTEDVP
FLHFELCYYQ AIEHGIRLGL PRVEAGAQGQ HKIARGYLPS PVYSAHWIAD PALREPVARY
LEREREAVSA EIEMLTEEFS PFRHEG