Gene Cpin_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5072 
Symbol 
ID8361248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6317091 
End bp6318185 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content46% 
IMG OID644967220 
ProductHEAT domain containing protein 
Protein accessionYP_003124705 
Protein GI256424052 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.946755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.336165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC TGAAAGATCT CTACTCCCCT GCCTTTTACG ACGGATTGGC CAATATACTG 
GTAAAAACGA TTCCTGCATT CAATAAACAA AAATTCATCC AACGGATCTA CCAGCCAGGT
TTTCAAGAAA AAGAGCTGAA AGAGCGGATG AAACACACCA CGGAGGTATT ACATGAGTTT
CTGCCTGCTG ACTACAAAAA GGCGGTCCCA TTGATCAAAG ACAGTATCGA CGCGCTCAGA
AAGGCGGGTT ATGGAGAGGC GCTGGAATTT ATCTTCTTTC CGGATTATCT GGCTACTTAC
GGACTGGAAC ACTACGATAT ATCCGTAAAA GCCCTTGAGT TTGTGACACA GTTTATTACC
TGCGAATTTG CGGTAAGGCC GTTTCTGATT AAATATGGCG ACAAAATGAT GCAACAGATG
CAGACCTGGT CTTCGCATAA AAACGCCAAA GTACGCAGAT TGTCCACAGA GGGCTGTCGT
CCCCGACTTC CCTGGGCAAT AGCAGTACCA TTCCTGAAGA AAGATCCGTC TTCCATACTT
CCTATCCTGG AAAACCTGAA ACAGGATCCT TCCGAGTCCG TAAGACGTAG CGTGGCCAAC
AACCTCAATG ACATTGCCAA GGACCATCCG CGTCTGGTCA TTGCCATCGC TTCCAAATGG
AAGGGGCTTG GTAAAGAAAC AGCTGCCATT ATTAAACATG GCAGCCGTAC CCTGCTCAAA
CAGGGACATA AAGAAATCCT GGCTCATTAT GGCCTGGAAA GCGTGCATGT TGCCTTCAGC
AACTTCAAAG TACTGACTCC GAAAGTGAAA ACCGGCGATA GTCTGGCATT CTCGTTTACC
GTCAGGAATA AAGATGCGCA GGCACAGACC ATCCGGCTGG AATATGGCAT TTATTACCTG
AAACAGAATG GTACTTTATC TAAAAAGGTG TTCAAGATCA GTGAAAAGGC CTATAAACCG
GGAGTGCAGG TCGAAATTAT ACGTAAACAG TCCTTCCGGC TGATCACAAC ACGGGTATTC
TACCCCGGCA AACACCAGCT TTCTATCATC ATCAACGGAG AAGAACAACC CGCCAGAAGC
TTTGAGCTGA TCTAG
 
Protein sequence
MSLLKDLYSP AFYDGLANIL VKTIPAFNKQ KFIQRIYQPG FQEKELKERM KHTTEVLHEF 
LPADYKKAVP LIKDSIDALR KAGYGEALEF IFFPDYLATY GLEHYDISVK ALEFVTQFIT
CEFAVRPFLI KYGDKMMQQM QTWSSHKNAK VRRLSTEGCR PRLPWAIAVP FLKKDPSSIL
PILENLKQDP SESVRRSVAN NLNDIAKDHP RLVIAIASKW KGLGKETAAI IKHGSRTLLK
QGHKEILAHY GLESVHVAFS NFKVLTPKVK TGDSLAFSFT VRNKDAQAQT IRLEYGIYYL
KQNGTLSKKV FKISEKAYKP GVQVEIIRKQ SFRLITTRVF YPGKHQLSII INGEEQPARS
FELI