Gene Caul_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0497 
Symbol 
ID5897952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp541031 
End bp542179 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID641560980 
Producthypothetical protein 
Protein accessionYP_001682129 
Protein GI167644466 
COG category[S] Function unknown 
COG ID[COG2828] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCC AGACGACCTT GCCCTGCGTG TTGATGCGCG GAGGCACCAG CAAGGGGCCC 
TATCTGCACG CCCGTGACCT GCCGCCGCCA GGCGAAGCGC GCGACGCTCT GCTGATCCGC
CTGATGGGCA GCCCGGATTT GCTGCAGATT GACGGGCTGG GCGGCTCGCG GCCGATCACC
TCCAAGATCG CCATCGTGAG TCCCTCCGAG CGGGAGGACG CCGATGTCGA TTACCTGTTC
GCCCAGGTCG ATATCGAACG GGCGCTGGTT GGCTATCAGG GCAACTGCGG CAACATCTCA
TCAGGCGTCG GCCCGTTCGC CATCGACGAG GGCCTGGTCC CCGCCGCTGA GCCCGTGACC
CGGGTGCGGA TCTTCAACGT CAACACCGGC AAGGTCTTCG TGGCCCATGT TCCGGTCGAG
GGCGGTCGGG CCAGGGTCGA CGGCGACTTC GCCATTCCAG GCGTGCCGGG CACGGGCGCC
GAGATCGTTC TCGACTACAC CAACACCGTC GGCGCCAAGA CCGGACGGCT GCTGCCCACC
GGCTCGCCGC GCGACGTCAT CGAACTCGAG GACGGCGCCC GGCTGCGGGC CACGATCTGC
GACATCGGCA ATCCGACCGT CTGGCTGTTC GCCGACGACC TGGGTGTGGA TGGCTCAATC
CTGCCGGTCG ACATCGACGC CCATCCGACG CTGCTCGACC GCTGCGTCGA GATTCGCGGA
AAGGCGGCGC AGATGGCGGG GATGTGCGAC GACTGGCGCA AGGCCGAGAC CCAGTCACCG
GGCCTGCCGA CCCTGGGATT CGTGGCGGCG CCGGCCGACT ATGTCGCCTC GAACGGCGAG
GTCGTCGCCG AGTCCGACGT CGATCTTCGC GCCCGTCTGA TCTTCATGAA CCGCGCCCAC
GAAAGCATGG CCGGCACCGC CTCGGTCAGC CTGGCCGCCG CCTCGCGGGT TCCGGGTTCG
GTGCCCCACG AGGTCGCCGT GAATCGCGAC GCCGACCAGC TTCTCATCGG CCACCCGCTG
GGCAGCATGG CCGTCAAGGT GTCGTCCCGT CCGGGCGGGG CGGACGGCGT CGTGTTCGAC
ACCGTTGGCT TCAGCCGGAC CGCGCGACGG CTGATGTCCG GCGTCGCCTA TCTCCCGGCG
ATCAGGTGA
 
Protein sequence
MAVQTTLPCV LMRGGTSKGP YLHARDLPPP GEARDALLIR LMGSPDLLQI DGLGGSRPIT 
SKIAIVSPSE REDADVDYLF AQVDIERALV GYQGNCGNIS SGVGPFAIDE GLVPAAEPVT
RVRIFNVNTG KVFVAHVPVE GGRARVDGDF AIPGVPGTGA EIVLDYTNTV GAKTGRLLPT
GSPRDVIELE DGARLRATIC DIGNPTVWLF ADDLGVDGSI LPVDIDAHPT LLDRCVEIRG
KAAQMAGMCD DWRKAETQSP GLPTLGFVAA PADYVASNGE VVAESDVDLR ARLIFMNRAH
ESMAGTASVS LAAASRVPGS VPHEVAVNRD ADQLLIGHPL GSMAVKVSSR PGGADGVVFD
TVGFSRTARR LMSGVAYLPA IR