Gene Caul_5355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5355 
Symbol 
ID5897124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp66798 
End bp68855 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content69% 
IMG OID641550647 
Producthypothetical protein 
Protein accessionYP_001672133 
Protein GI167621625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.323781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCCAGACCGT CTCCCCAACC CGGATGGACG AGATCACCGT CCGCCTCGGC 
GACCTTGGCC TCGCGCCCGA GAACCTTCGC TTCCAGGAGC CGGCGGACGA CGGCGTGCCC
CAGCTCGCCG AAACCGTGCT GGCCGCCGGC GTCCTGATCC CGCCGATCGT GCGGGCGGGC
CTCAAGAGCG AGCAGGCCTT CATGACGCTG GACGGCCGCC GCCGCCGGTT CAGCTTGCTG
GTCCTGCGCG ACCGCGGCGA CATCGACGAC GACTATCCCG TCGTCTGCAA GCTGGCGGCG
AGCAAGGCCC AGCAGGCCGC CGCGATCATC CTGCCCAACG CCGAGGTCGC CCCGGTCCAC
ATCGCCGACA TCATCGGCGC CATCGGTAAG CTGCGCAAAG CCAAGATGGA CACCGCCGGC
ATCGCCCGCG CCCTGGGCTA CGCCGAGCTG GAGATCAAGC GCCTGGAGGC CCTGTCAGCT
GTCCACCCGA GCGTGCTGAA GGCCCTGCGC CTGGGCAAGC TGAACCTCAA GCAGGTCCGC
CTGTTCGCGC GGATGCCCGA CAAGAAGCAG CAGGGCGAGT TGGCGGAGAC CGCGCTCGAC
GGTCACTTCC ATGACTACCA GCTGCACCAG GTGATCAACG GCTCGCGCCT GACCATCGAG
GACGACCGCT TCGGCCTGGT CGGGATGGCC CGCTACACCG CCGCCGGCGG GCGGGTCGAG
TCCGACCTGT TCGCCGAACT GGCCGATGTC CTTCTCGATC CCGGCAAGCT GCAGGATCTG
TGGCGGGAGC GCGCCGCCCC CTTCGTCGAG GGGTTCAAGC AGCTTGGGCT CGCCGTCTAC
ATCGGGCGAG ACGCCGGCTT CCGGGCCCCG GAAGGCTTCG AGACCCTGCC CTACGTCTAC
CCCGGCGACC TGACCGATGA GACCAAGGCG GTGCTAGCGG CCGCGCGGCA GCGGGTGGCC
CAGGCCGCGC GTGACCTCGG CGGTGTCGAT CTCGCCGCCG ACGATGCGGC GCTGACGATC
TTCCCCCTGC TGCAGGCGAA GATGGAGGTG GCCTCGGCTC CGCTGAAGCG GCTGGCGCTC
GGCGCCGTCA TCTTGTCGCC GGACGGGGCG ACGGGGATCT CGGCCGAGTT CTTCGCCGCG
CCGGTGTCGG AGGAGCTGCT GGATGGGGCT GGGGATGACT TGGCTGGGGA GAATGGGGCC
GACGAGGACG ATGCGAGCGG CGGCCAGGGC AACGGCGCGC GCTACGGTCG CTCCGCCAGC
GACGTGGAAG TGCCCAAGGC CGACGTTGAT GTCGAGGGCT CCAGCCACGT CCTGCACGAG
ACCCGCACCG ACGTGGCTAC GCGCGGGCTG ATCCGTGATC TTGCCGACAA TCCGGCCGCC
GCCCTGACAG CCTTGGTCGC CCAGCTGTTC AAGCAGCTGG CGCTGCAAGG CGGGCCTGGC
CATGAGGAGT CGGCGCTGGC CATCAATGCC ACCGGCTACC GCCGTGGCCA GACGCCGGCG
ATCGGCGCTC TGGACGGTGA TATTCGAGCC CGGCTGGAGG CGCGGCGTGT GGTCTACAAG
GCCTCGGGAC TTCGTCCCAT CGCCTGGGTC GACGGCCTCG CCCACGGCGA CAAGATGGCC
TTGCTGGCGG AACTTACCGC CATCACCCTG AACCTTCGGG AAGCCCGCAC CAGCAACATC
CGCGACTCCG CACGGGCCGA AGCCATCGAG CTGGCGCAAC TTTGCGCCGC CGACATCTCG
GCGCACTGGA CGCCTGACCC CGACTACCTG GCTGTCCACT CCAAGAGGCA GCTGCTGGTG
CTGCTGGACG AGATGCAGTT AGACGATCCT CGGGCCAAGA CCCTAAAGAA GGACGAGCTA
GTCGTCCTGG TGGCTGACGC CGCGGCCGAG CGCCAGTGGG CGCCGCAGGT GCTGTCCTGG
GAGAGCACCA CGGTTGAGAC GCAGCCGCCG GCCGATGAGG ACCAGGACCA GGACGACGGT
GATGAGGCCC TGGCTGACGA CCTGACACCA GGCCCGGCCG CACCTTCCCC GGAGGTCTCG
GTTCAACACG CGGCCTGA
 
Protein sequence
MTDTQTVSPT RMDEITVRLG DLGLAPENLR FQEPADDGVP QLAETVLAAG VLIPPIVRAG 
LKSEQAFMTL DGRRRRFSLL VLRDRGDIDD DYPVVCKLAA SKAQQAAAII LPNAEVAPVH
IADIIGAIGK LRKAKMDTAG IARALGYAEL EIKRLEALSA VHPSVLKALR LGKLNLKQVR
LFARMPDKKQ QGELAETALD GHFHDYQLHQ VINGSRLTIE DDRFGLVGMA RYTAAGGRVE
SDLFAELADV LLDPGKLQDL WRERAAPFVE GFKQLGLAVY IGRDAGFRAP EGFETLPYVY
PGDLTDETKA VLAAARQRVA QAARDLGGVD LAADDAALTI FPLLQAKMEV ASAPLKRLAL
GAVILSPDGA TGISAEFFAA PVSEELLDGA GDDLAGENGA DEDDASGGQG NGARYGRSAS
DVEVPKADVD VEGSSHVLHE TRTDVATRGL IRDLADNPAA ALTALVAQLF KQLALQGGPG
HEESALAINA TGYRRGQTPA IGALDGDIRA RLEARRVVYK ASGLRPIAWV DGLAHGDKMA
LLAELTAITL NLREARTSNI RDSARAEAIE LAQLCAADIS AHWTPDPDYL AVHSKRQLLV
LLDEMQLDDP RAKTLKKDEL VVLVADAAAE RQWAPQVLSW ESTTVETQPP ADEDQDQDDG
DEALADDLTP GPAAPSPEVS VQHAA