Gene Caul_5388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5388 
Symbol 
ID5897208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp98285 
End bp99973 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content68% 
IMG OID641550678 
Producttransposase IS66 
Protein accessionYP_001672164 
Protein GI167621656 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTG ACCTCGCAGC CCTGCCGGAC GATATCGAAG CGCTGAAGGC GGCGCTTCTG 
GTCGCCCGGG CCGAGGTCGC GCAAGCGCAA GACGTGGCCG CGAGAGCTCA GGCCGAAGCC
TCCAGAGCCC AGGCCGAAGC CGCTGAAGCC AAGGCGCGCG TGTCTGACGA CCAAGCGCTG
ATCGCCCACC TGAAGCTCCA GATCCAGAAG CTCAATCGTG AGCGCTTCGG CCCTAGCTCG
GAACGCACGG CCCGTCTGCT TGATCAGCTG GAACTGCAGT TGGAGGAGCT GGAGGCTTCG
GCGACGGAAG ACGAGCTGGC CGCCGAGATG GCGGCGGCTC GGACCACGAC GGTGGCCGCC
TTCAGCCGCA AGCGGCCTTC GCGCCAGCCC TTCCCGGAAC ACCTGCCGCG TGAGCGGGTG
ATCGTGCCAG GTCCGACCGC CTGCGCCTGC TGTGGCGGGC TGCGCCTCTC GAAGCTGGGC
GAAGACGTTA CCGAAACGCT GGAGGTCGTG CCCCGGTCCT GGAAGGTCAT CGCGCACGTC
CGCGAGAAGT TTAGCTGCCG CGACTGTGAG GCCATCGGCC AGGCGCCGGC TCCGTTCCAT
GTGATCGCCA GGGGCTGGGC GGGTCCCAGC CTGCTGGCCA TGATCCTGTT CGAGAAGTTT
GGTCAGCATC AGCCGCTCAA TCGCCAGGCC GACCGCTATG CTCGCGAGGG CGTGCCGCTC
AGTCTGTCGA CCTTGGCCGA TCAGGTCGGG GCCTGCACGG CGGTGCTGGC GCCGCTGTTC
CAGCGGCTGG AGGCTCACGT GCTTGCCGCC GAACGATTGC ACGGCGACGA CACCACGGTT
CCGGTATTGG CCAAGGGCAA GACCGACACC GCCAGGCTCT GGGTCTATGT GCGCGACGAC
AAGCCGTTCG CGGGATCGGC GCCGCCGGGC GCGGTCTTCT ACTACTCGCG TGATCGGGGT
GGCGAGCATC CGCAAGCGCA CTTGTCAGGT TATGCCGGCC TGTTCCAGGC CGACGCCTAT
GGCGGTTACG GCAAGCTCTA TGAGCCAGGG CGAAACCCAG GTCCCATTCT TGAAGCAGCC
TGCTGGGCAC ACGCGCGTCG GCCGTTCTTC GTGCTGGCCG ACCTGGAGCA GAATGCGCGC
CGCAAGGCTC GCGGCGCGGC GCCGGCGGTG ATCTCGCCGA TCGCCCTGGA GATGGTCCAG
CGGATCGACG CGCTGTTCGA GATCGAGCGG GGGATCAGCG GCCAGGACGC AGATAGGCGC
CTAGCGGTGC GACAGGCGCT CAGCGCCCCG CTGGTCGCCG AGATGGAGAT CTGGATGCGC
GAGCAGCGCG CCAAGCTCTC ACGCGGTCAT GACTTGGCCC GGGCCTTCGA CTACATGCTC
AAGCGCTGGG CCGCGTTCAC GCGCTTCCTC GACGACGGCC GCGTCTGTCT GAGCAACAAT
GCCGCCGAGC GGGCGCTGCG CGGCGTGGCC ATGGGGCGTA AGTCCTGGCT GTTCTGTGGT
TCTGATCGCG GCGGTCAACG CGCGGCGGTG ATGTACAGCC TGATCGTCAC CGCCAAGCTG
AACGACATCG ACCCTCAAGC CTGGCTGGCC GACGTCCTGG CCCGCATCGC CGAGCATCCC
AGCCAGCAGC TCGATGAACT ACTGCCCTGG AACTGGCAGC CCCTCGCTAC CGCTGACCGC
GCCGCTTAG
 
Protein sequence
MDADLAALPD DIEALKAALL VARAEVAQAQ DVAARAQAEA SRAQAEAAEA KARVSDDQAL 
IAHLKLQIQK LNRERFGPSS ERTARLLDQL ELQLEELEAS ATEDELAAEM AAARTTTVAA
FSRKRPSRQP FPEHLPRERV IVPGPTACAC CGGLRLSKLG EDVTETLEVV PRSWKVIAHV
REKFSCRDCE AIGQAPAPFH VIARGWAGPS LLAMILFEKF GQHQPLNRQA DRYAREGVPL
SLSTLADQVG ACTAVLAPLF QRLEAHVLAA ERLHGDDTTV PVLAKGKTDT ARLWVYVRDD
KPFAGSAPPG AVFYYSRDRG GEHPQAHLSG YAGLFQADAY GGYGKLYEPG RNPGPILEAA
CWAHARRPFF VLADLEQNAR RKARGAAPAV ISPIALEMVQ RIDALFEIER GISGQDADRR
LAVRQALSAP LVAEMEIWMR EQRAKLSRGH DLARAFDYML KRWAAFTRFL DDGRVCLSNN
AAERALRGVA MGRKSWLFCG SDRGGQRAAV MYSLIVTAKL NDIDPQAWLA DVLARIAEHP
SQQLDELLPW NWQPLATADR AA