Gene Cpin_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3056 
Symbol 
ID8359221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3774990 
End bp3776621 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content45% 
IMG OID644965234 
ProductTPR repeat-containing protein 
Protein accessionYP_003122730 
Protein GI256422077 
COG category[R] General function prediction only 
COG ID[COG4785] Lipoprotein NlpI, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCA TCTCCCTGTC TGTCAAACGC TGCCGTTTTC ATTTTGGTTC GATTTATGTA 
GTTAGTGGAC GTTCGATTTT GTTAACCCAA GCTATACGTC GTATGAAATA CATACCGCTC
AAACTAGTAA GAAATATATC AGGACTTTTA CTGCTGACTA CAACAATGGC TTACGGCCAG
ACAGATTCAT TATTGGTGCA TAGCAGATCA ACTGCGTATT ATGCTGAGGG AAAGAGCTAC
CTCAAAGAGA AGGATTATAA TGCCGCCATT CAGAGTTTTA CAGCGGCGAT CGCCATTCAT
CCCACTGACT CCGCATATGC AAACCTTGGA TTTGCCTATA TCCGGAAGGA GAATGATAAA
AATGCATTTG TTGCCCTGAA TAAGGCGCTT GATCTCAATG GAAACTATGC CTGGGCTTAT
TGTCTCCGGG GATATCTTTA TACGAAAATT AACGTCCCGG AATTATCTTT TAATGACTTC
TCAAGGGCCA TTGCACTAAA TGCAAAAGGA GGGGATTTGT CCGGCGCTCA GAATGCGGTA
TCAGATAAAT CGGTGATCCG GGATTATACG AAGAAAATAG GAAAGGATCC TAAGGATGAT
AGTGCTTATT TACAACGGGC GCGCGCCTAT GAGAGCCGGG AGAAAAATAA ACAGGCGGTA
AAGGATTATA AGAAAGCAAT CGCACTCGAT CCGGATAATA CAGAAGCCTA TTACGGTCTG
CAAAATCTTT ATCTGCAGGG GAAAGGACCT AAACAAAGTT CAGCTCCTGC TGATGATATC
TCGGATACGA CGGAACAGTA TATAGAAGAA CAACCGTTCA CAGAATTCTA TGCTACCCAG
GGTACAACTT ATTCAACGCT TGAAGAAAAG TATGCATTGA CGATTCGTGA TTATACAAAA
GTGATTACGC TGTTTCCCGG GAACAGCGCT GCTTTTCGGG ATCGCGGCTA CCTGTATGCA
AAACTTAATA AGACAGACTC CGCTATTGCT GACTTTACAA GCGCAATCAC ACTCGATCCG
CAGTCATCGC TTGCATTGGG TTACCGGGGT GCATTGTATA TAGAAACAAA GCAACTGGAA
TCAGCAATCG CTGATTTGTC AGCCGCGATT AAAATCGACC CGGATGCTTT GCAGCACTAT
TATAATCGTG GGCTGGCCTA CTACCAGTGG GGCGCATATG AACCGGCTAT AGCAGATTTT
ACTACCTTAA TCACCAAAGG TCCGCCTAAT GCAGTTGCCT ATCGCTACCG GGGGAATCTT
TATACATACG TCAATAAGCC TGCATTAGCC ATTGCTGATA TCAGCAAGGC GATCGATCTG
GCGCCAAAAG AAGCAGAAAG TTATGCGGTT CGCGGACTGG CCTATGCTTT ACAAGCAGAT
TATAAACAGG CCGTCCAGGA TTTTAGCACT TCCATCAAAC TGGATCCTGG CAGTAAGACG
ATATATGTTA ATCGCGCATT GGCGTATAAG TACCTGAACA ATTATAAGGC GGCTATTAAA
GATTATACCC AGGCGATTGA GCTGGACCCG AATGATGTGG ACGTATATAA GGAACGGGGC
AAGGTGTATG AGCAGATGGG AAAAAAAGAC CTGGCAGCCG CTGATTTTAA GAAGGCAGGA
GCGGTGGAAT GA
 
Protein sequence
MDIISLSVKR CRFHFGSIYV VSGRSILLTQ AIRRMKYIPL KLVRNISGLL LLTTTMAYGQ 
TDSLLVHSRS TAYYAEGKSY LKEKDYNAAI QSFTAAIAIH PTDSAYANLG FAYIRKENDK
NAFVALNKAL DLNGNYAWAY CLRGYLYTKI NVPELSFNDF SRAIALNAKG GDLSGAQNAV
SDKSVIRDYT KKIGKDPKDD SAYLQRARAY ESREKNKQAV KDYKKAIALD PDNTEAYYGL
QNLYLQGKGP KQSSAPADDI SDTTEQYIEE QPFTEFYATQ GTTYSTLEEK YALTIRDYTK
VITLFPGNSA AFRDRGYLYA KLNKTDSAIA DFTSAITLDP QSSLALGYRG ALYIETKQLE
SAIADLSAAI KIDPDALQHY YNRGLAYYQW GAYEPAIADF TTLITKGPPN AVAYRYRGNL
YTYVNKPALA IADISKAIDL APKEAESYAV RGLAYALQAD YKQAVQDFST SIKLDPGSKT
IYVNRALAYK YLNNYKAAIK DYTQAIELDP NDVDVYKERG KVYEQMGKKD LAAADFKKAG
AVE