Gene Caul_5168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5168 
Symbol 
ID5897418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp84190 
End bp85158 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content64% 
IMG OID641555271 
Productplasmid replication initiator protein-like protein 
Protein accessionYP_001676602 
Protein GI167621817 
COG category[L] Replication, recombination and repair 
COG ID[COG5534] Plasmid replication initiator protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.697276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CCAAGGACCG CAACAACCAG TTCGACCTCT TCATCCCGTT GATGCGCGAT 
CTTCCGCTGA AGGATCAGCG CGAGACGATG GAGCGGCCGT TCTTCAGCCT GCAAAAGCGC
AAGCGGCTCA AGCCCATCGA GTACAAAAGC GCCGACGGCG AGGTCTCGGT GAAGGTCGAA
GCCGTGCCTG CCTACGGCAT GGCCACCATC TGGGACGGCG ACATTCTCAT CTGGGCCGCC
AGCGCCTTGA ACCGCCTCAA GGCCGAGGGG CGCAACGATG TGCCCCGCTC GTTGAAGGTC
ACCGCCTATG ACTTGCTGCG CTCGATCCAG CGCGACACCG GCGGCAAGGG CTATAACGAC
CTGAAGGCGG CTTTGGACCG CTTGGCGACC ACGACCATCT TCACCTCGAT CCGCGCCAAG
AAAGGCCGCG ATCGCCGCTT CAGCTGGCTC GATGGCTGGG ACGTCGAGGT CGATCCGATC
ACCGACAAGC CCATCGCCTT GAAGATCACG CTCTCGGACT GGGTCTGGGA GGGGATCATG
AACGAGAAGT CGGTGCTGAC CATGCACCCC GACTACTTCC AGATCTCCGG CGGGCTTGAA
AAGGCCATCT ACCGGATCGC GCGCAAGCAC GCCGGCGACC AGGACGACGG CTGGACGTGC
CGCGTCAGCG TGCTGCACGA GAAGACGGGC TCCGACAGCG AGCCCAAGGA ATTTAGCCGG
ATGCTGCGAA AGATCGTCGA GGTCAACGAG CTGCCGGAAT ACGACATGGC CTTCGTGACC
ACCGGCGACG GGAGCCAAGG TGTGCGCTTC ATCCGCCGTT CGGTCGTCGA ACGCGTGCAA
ATCCAGGCTG AACTAGAGGC CGAGGCCGCC GGCCTGGCGC GGCGGGAACG AGAAGATCGC
CGAGCCGACG AAGTCGATGG CCGACTCGAT CCCTGGGCCA AGCGCCGAGT CCCGAGCGCC
GAAGGCTAA
 
Protein sequence
MTVPKDRNNQ FDLFIPLMRD LPLKDQRETM ERPFFSLQKR KRLKPIEYKS ADGEVSVKVE 
AVPAYGMATI WDGDILIWAA SALNRLKAEG RNDVPRSLKV TAYDLLRSIQ RDTGGKGYND
LKAALDRLAT TTIFTSIRAK KGRDRRFSWL DGWDVEVDPI TDKPIALKIT LSDWVWEGIM
NEKSVLTMHP DYFQISGGLE KAIYRIARKH AGDQDDGWTC RVSVLHEKTG SDSEPKEFSR
MLRKIVEVNE LPEYDMAFVT TGDGSQGVRF IRRSVVERVQ IQAELEAEAA GLARREREDR
RADEVDGRLD PWAKRRVPSA EG