Gene Caul_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3808 
Symbol 
ID5901270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4130621 
End bp4132705 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content65% 
IMG OID641564330 
Productnuclease 
Protein accessionYP_001685432 
Protein GI167647769 
COG category 
COG ID 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.310263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CTGTAATCGA AACGACCGCC CCGATTTCCA ACGTTCCCGC CGCCGTCAAC 
GGCGCGGAAG TTCTCATCCC TCTCAACAAG CTCAAGAAGT CCCACCGGAA CGCCCGGAAG
ACGCCGCATA GCGAGGCGTC TATCGAGGCC AAGGCGGCGA GCATCCACGC CAAGGGCATC
CTGCAAAATC TGGTGGTGGA GCCGGAGTTT GACGCCGAGG GCGCGGAAAC CGGCTTCTAC
CTTGTCACTA TCGGCGAAGG CCGGAGGCTG GCGCAGTTGC TGCGCGCCAA GCGCAAGCAG
ATCAAGAAGA CCGAGCCGAT CCGCTGCGTC ATCGACACGC AGAACGATCC AGCCGAGATC
AGCTTGGACG AGAACGTCAC CCGCGAAGAC CTTCACCCCG CCGACCAGTT CGAGCGCTTC
CGCGAGCTTG CGGAAAACAA GGGATGGGGA GCCGAGGAAA TCGCCGCCCG GTTTGGCGTC
ACCCCGCATG TGGTGCGCCA GCGGCTGCGC TTGGGCGCGG TCAGTCCGAA GCTGATGCAA
GTCTATCGCG ATGAAGGGCT GACGCTGGAC CAGTTGATGG CCTTCGCCAT CGTGGAGGAC
CATGCGCGGC AGGAGCAGGT TTACGAGAAC CTCTCCTATA ATCGCGATCC GTCGATCATC
CGCCGTGACC TCACCCGCTC GCACATCGCG GCAGCGGACC GGCGCGCGAT CTTCGTCGGG
CCGGAAGCGT ATACCGAGGC GGGCGGCGTC ATCCTGCGCG ATCTGTTCAC CGAGGATCGC
GGCGGCTTCT TTGAGGATGT CGTGCTGCTG GACCGGCTTG TCAGCGACAA GCTGGAAAGC
ATCGCCGTCG AGGTTCAGGT GGAGGGCTGG AAATGGGCGA GCGCACACAT CGACTATCCC
CACGCCCACG GCCTGCGCCG CAACTACCCG CAACCGGTGG CGCTGTCGGC GGAAGACGAG
GCGGCGCGCG AAGCCGCGCA GGCGGAATAT GACGCGCTGA CCGAGCAGTG GGATAGCGCC
GACAACCTTC TTCATGAGGT GGACGAGCGT TTCGGGGAGC TAGAAGCCGA GATCGAACGC
ATCGATGCCC TGCGTCATGC CTACGATGCC GACGACATCG CGCGTGGCGG CGTGATCGTG
GTGCTTTCCC ATGACGGGAC GGCGCGGATC GAGCGCGGCT TCATCCGGGC CGAGGACGAG
AAGCCTGAAC CGGAGGTGGA GGCACAAGCC AATACGGGAG GCGAAGACTA TACCGTCACC
GAAGACGGCG AGATCATCGA GGGTGGCGAC GAGGACCGGG TTTCGGCCTT GGAGACGGAA
GAAGAAGACG CCGACGATGG CAAGCCGCTG TCGGACCTTC TCGTCCGCGA CCTGACCGCG
CATCGCACCC TTGGCCTGCG TCTTGCCCTT GGCGAGCAGC CGGACATGGC GTTGATCGCC
GTCACCCACG CCCTCGCGGC GCAGACCTTC TATCGCGGCG TGGAAGCCCA TTGCCTCGAT
ATCCGGCCGA GCAGCGCGCA TTTGGGCGGG CATGCGGACG GCATCGAGGA CACGGCGGCG
GCGAAGCTGC TGGCGGATCG TCATGACGGA TGGGCGGGAG ACATGCCGCG CGACGTGGCG
GACCTGTGGG ACTTCGTCGC CGGTCTGGAC CATGCGAGCG TCATGGCGCT GTTCGCGCAT
TGCGCCTCGC TGACGGTCAA CGCCGTGAAG CAGCCTTGGG AGCGCAAGGC CCGCGCCCAT
GAAGCCGCCG ACAAGCTGGC GACGGCGGTC TCCCTCGACA TGACCGCTCA CTGGACGTCC
ACGGTGCGGA CCTATCTCGG TCGCGTCACC AAGGCGCATA ATCTCGCCGC CGTGCGGGAG
GCCGTTAGCG ACGAGGCGGC GGAACGCCTG TCGGGCTTGA AGAAGCAGCC TATGGCGGAA
GCCGCCGAAC AGCTTCTCGC CGGGACCGGC TGGCTTCCGA CCTTGATGCG GACGGCGGAA
CCCGCGTGGC CCGCAATCGA GCAGCCCGAC GTGCAGGAGG TGGTTGAAAC GGAGCATGCC
GCGAGCGCGG ATGATGGTGA AAGCTACGCC ATCGCCGCCG AGTGA
 
Protein sequence
MTTAVIETTA PISNVPAAVN GAEVLIPLNK LKKSHRNARK TPHSEASIEA KAASIHAKGI 
LQNLVVEPEF DAEGAETGFY LVTIGEGRRL AQLLRAKRKQ IKKTEPIRCV IDTQNDPAEI
SLDENVTRED LHPADQFERF RELAENKGWG AEEIAARFGV TPHVVRQRLR LGAVSPKLMQ
VYRDEGLTLD QLMAFAIVED HARQEQVYEN LSYNRDPSII RRDLTRSHIA AADRRAIFVG
PEAYTEAGGV ILRDLFTEDR GGFFEDVVLL DRLVSDKLES IAVEVQVEGW KWASAHIDYP
HAHGLRRNYP QPVALSAEDE AAREAAQAEY DALTEQWDSA DNLLHEVDER FGELEAEIER
IDALRHAYDA DDIARGGVIV VLSHDGTARI ERGFIRAEDE KPEPEVEAQA NTGGEDYTVT
EDGEIIEGGD EDRVSALETE EEDADDGKPL SDLLVRDLTA HRTLGLRLAL GEQPDMALIA
VTHALAAQTF YRGVEAHCLD IRPSSAHLGG HADGIEDTAA AKLLADRHDG WAGDMPRDVA
DLWDFVAGLD HASVMALFAH CASLTVNAVK QPWERKARAH EAADKLATAV SLDMTAHWTS
TVRTYLGRVT KAHNLAAVRE AVSDEAAERL SGLKKQPMAE AAEQLLAGTG WLPTLMRTAE
PAWPAIEQPD VQEVVETEHA ASADDGESYA IAAE