Gene Cpin_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1001 
Symbol 
ID8357115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1202871 
End bp1203908 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content44% 
IMG OID644963155 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_003120700 
Protein GI256420047 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAT TTCTGGTTAT CCTATGCGTA ACACTCTCAC TGAACGTTCA TGCTCAGTTA 
TCATTCTACA ACAGTGATGA TTATTGGGCA AATTTTAGTT TTCAGTCAGA TAGTGTTAAG
ACGGCGGCGA CAGATACCTG TCTGGTTTTT GTGAGTAACA GGCACCTGTA CAAGGATAGT
TTACGCTTTG TAGATGAATA TGTGGATACG TCAGCGTTGA AATATTTCTT TCTGCAGAAG
CATGGTGGAC AATGGAATGT ATTTCAGACG CCTACGCTGG CGGATGCGAT GCATTTATTG
CCGGAAAAGC GGGATATTGT CGTTTATGCG GAGGGGATGG GGAAGATCTT TACGACAAAT
GTGGAAAGGG CATTGCTGAT GCGGTCGCAA TACCAGGTGA ATGTGATTAT GTTTGATTAT
GCCAGTATCA ACACCACTTA CCGGCCGGCG CGTAATTTCA GGTTTGCACG GGAAAATGCC
CGTTTATCGG CGCCGCATTA TTACCGGCTG CTCCGTGTTA TACAGCAGGC CCGGAGGGAA
AAAGAGGACT GGATGCAGCA GGTGAAGGTC TCTACGTTCT GTCATAGTAT GGGTAATATT
ATATTGATGG AGATGATGAA AGTGCAGGAT TATCAGCAAT TGAATAATGA ACCTTTTATT
GATAATGTAG TGATCAATGC GGCTTGTGTA CCCTCTAAAA AACACGCGGA ATGGGTAGAG
AACATACATT TTGCGAATAA GATCTATATC CATTATAATA AATCAGACTG GCAACTAAAA
GGGGCTCATT TACTGACGCT GGAGGCGCAG TTGGGGGAGA AGCTTAAAGG GAAGCTGGCG
AAAAATGCCA ATTATGTCAA TTTTCGGGAG CAGGTAGGCA GTCAGCATAG TTATTTCCTG
AATTTTCCCC AGAATGAGTA CCGTATGACG AATGAGATGA AGGATTACTT TGTACAGCTA
TTCAGCGGCA ATACGGCTGT GCTGGAAGAA TATAAGACGC TGGTGAAGAA GGAGGGCAAC
GGGACCAGTG TGAATTAA
 
Protein sequence
MKRFLVILCV TLSLNVHAQL SFYNSDDYWA NFSFQSDSVK TAATDTCLVF VSNRHLYKDS 
LRFVDEYVDT SALKYFFLQK HGGQWNVFQT PTLADAMHLL PEKRDIVVYA EGMGKIFTTN
VERALLMRSQ YQVNVIMFDY ASINTTYRPA RNFRFARENA RLSAPHYYRL LRVIQQARRE
KEDWMQQVKV STFCHSMGNI ILMEMMKVQD YQQLNNEPFI DNVVINAACV PSKKHAEWVE
NIHFANKIYI HYNKSDWQLK GAHLLTLEAQ LGEKLKGKLA KNANYVNFRE QVGSQHSYFL
NFPQNEYRMT NEMKDYFVQL FSGNTAVLEE YKTLVKKEGN GTSVN