Gene Caul_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1356 
Symbol 
ID5898811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1440903 
End bp1442123 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID641561843 
Producthypothetical protein 
Protein accessionYP_001682984 
Protein GI167645321 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.368501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC ACGCCGCGCC TTTCGCCAGT CCCGAGCCCG AGCCGGCCGT CGCGCACCGC 
TCGCGCGCCG CCCTGCTCAA GCGGCTGGCC GACGTGGTTT GCCTGCCGGC CAGCCGCATC
AACGCCTTCG AACGGTCGAT GACCGCCGAC CTGCTGGTCG AGATGCTGCG CGACGCCGTC
ACGGTCGAGC GCGAGAAGGT CGCTCGCCGC CTGGCCAACC TGGCCGAGAT GCCTGGCGTG
CTGGTCCGCA TGTTGCTGCG CGACGAATTG CCGGTGGCCC GCGCCCTGCT GGAAAACTCG
CCCAGCCTCA GCGACGCCGA CCTGATCAGT TGCCTCTACA ACGCCACCCA GGACCATCGC
CGGCTGATCG CCCTGCGGCG CGGGGTCAGC GAGGTGGTGG CCGACGCCCT GGTCGACATG
GACGAGACGG CGGTCACCGA GACCCTGCTG AAGAACGAGC TGGCGCGCTT CAGCCACCAA
GGGCTGGAAA ACATCGTCGC GGCCACGCGC GATAATCCGC AGCTGATTCC CCTGCTGCTC
AAGCGCTCCG AACTGCGGCC TAGCCACGCC TATGTGATGT TCTGGTGGTC GGACGCCGAC
GCCCGCCGGA CGATTTTGCA GCGCTTTGCG GTGTCGCGCG AGATTCTTCA GGACGCGGTC
GGCGATGTTT TCCCCCTGGC CTCGGCCGAG GGCTGGCAGG ACCCGCTTTC GCGAAAGGCG
TTGCAGTTCA TTGAACGTCG TCAACGAAAT CGCGCCGCTA TCGCCAAGAG TCCTTATGAC
AGTCTCGAGG CCGCGATCGC CGCCGCCCAG AACGGCATGA CCCGCGAAAC CGCCGAGGAG
ATCTCGTATC TGTCGGGCCT CAAGCCGATG ACCGGAGCGA AGATTTTCAC CGATCCGGGC
GGCGAACCCC TGGCCATACT CTGCAAGGCC ACCGGCCTGC CGCGCGGCGC GGTGCGCGCT
CTGTGGCGAG GGTTGCGCCG GCCCGAGACC GACGCGTCGG GCGCGCCGAC GCCGGGCCTG
GAACGGGTGC TGACCGCGTT CGACACCATC GCGGTGGACC GGGCCCAGAC CATGCTGCGC
TATTGGAACT GGTCGCTGTC CTCGGCCATG ACTCCGGCCC TGTTGAAGGC CATCCGCGAG
GGCGACGAAG CCGCCGTCGA TGAGTATTCG GTGCCGCAGC GCGCCGCCAT GCTGGCGCTG
TCTCGAGATT TTGGCAGATA G
 
Protein sequence
MNEHAAPFAS PEPEPAVAHR SRAALLKRLA DVVCLPASRI NAFERSMTAD LLVEMLRDAV 
TVEREKVARR LANLAEMPGV LVRMLLRDEL PVARALLENS PSLSDADLIS CLYNATQDHR
RLIALRRGVS EVVADALVDM DETAVTETLL KNELARFSHQ GLENIVAATR DNPQLIPLLL
KRSELRPSHA YVMFWWSDAD ARRTILQRFA VSREILQDAV GDVFPLASAE GWQDPLSRKA
LQFIERRQRN RAAIAKSPYD SLEAAIAAAQ NGMTRETAEE ISYLSGLKPM TGAKIFTDPG
GEPLAILCKA TGLPRGAVRA LWRGLRRPET DASGAPTPGL ERVLTAFDTI AVDRAQTMLR
YWNWSLSSAM TPALLKAIRE GDEAAVDEYS VPQRAAMLAL SRDFGR