Gene Cpin_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2421 
Symbol 
ID8358576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2976940 
End bp2979849 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content47% 
IMG OID644964607 
Productglycosyl hydrolase 38 domain protein 
Protein accessionYP_003122113 
Protein GI256421460 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.433161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATAC ATATTATGAA TAGGCTAAAA CGATGGTATA CAGACAAATA CTGTCTGCTG 
CTGCTGGCCG TAATGACCAT AGATCTGGCC CATGCACAGC AAACGCCTGC TTTTACGTCT
ATTGCCCTTC AACCGACCGT AGCTTATCTG AAACACCAGG GCCGTGCTGC CAGGATGGTC
AGACTGCTCT TTCATGGAGG CAATAGTTAT GCGCCCGCCA TGGCTTATAT GACTTTTAAC
GGCCTGTATG ATAGTATCCC GCTTGTGGCG ACTGAAAAAG GCATTGATAT ATTTGAACTG
CCTTTACCCG GTGGACCTGT ACACCACGAA ACACAGCTCT ATGTAAAAGT GAAATCCGGT
GGAAGGGAAT ATACCGCCCG TTGTATGGTA GCCCCTGCGC CGGAATGGAA AGTATACCTG
TTGCCGCATT CTCATGTGGA TATCGGGTAT ACCAATGTGC AGGAGAAAGT CATGCAATTG
CATATGAACA ATATTGACGA GGCCATCAAG ATTGCGGAAC GTACACAAAA CTACCCGCCG
GATGCCCGTT ATAAGTGGAA TACAGAGGCT TTCTGGGTAG TGGATCACTA CCTGGCTGCT
GCCGATGAAG CAAAGAAAAA AGCCTTCTGG GAAGCGGTTA AAAAGGGTTG GATCAATATT
GATGGCGCCT ACGCGAATAT CAATACCAGC GTGACAGATT CCCGTCAGCT GATGCAGATG
TTCTATACAG CGGTAAAGAC AGCCAAAGAC CACGGTCTTG ATATTAATAC CATGTTCCAG
GGCGACGTAC CCGGGGCTTC CTGGGGACTG GCCAGTCAGT CGGGTATAAC AGGGATCCGT
TATTTCTGGG GCGCGCCCAA TGCGGATGAT CGTATCGGTC AGTCGCCTGC ATGGAGAGAC
AGACCTTTTT ACTGGCAATC ACCCGGTCAG CAGAAAATGC TGTACTGGCA AAGCGAACCT
TATTCCATCG GTTACAGATT AAAAGGTAGC AAGATCCCTA ATTTCTTCAC GGTAGAAGAT
CCTGTACCTT ACTATACCGG TCATCCTTCT GAGAACTTCC TGAATCCTTA TCTGTTCGAT
TACCTGGCAG CACTGGACAA AAAGAAGTTC CCCTACAACA TGACTATCAT GACATGGGCG
ATGAGTGATA ATGCACCGAT TGATCCGGAA CTGCCGGAAG CCGTGAAAGC ATGGAATGAA
CGTTATGCAT CACCCAGATT GGTTATTACT TCCGTAAAAC AATTCTTCAA CGATTTTGAA
GCTGCCTATG CAGACAGAAT ACCGGTAGTA TCCGGTGATT ATACAGAGTT CTGGACAGAT
GGTATTGCTT CTGCTGCCAG AGAAACAGGG TACAACAGGA ATGCTTCAGA AACACTGCAA
CAGGCAGATG CTGTATGGGC ATTACGTGGC AAAGCAGACT ATCCGGCGAC AGCCATAGAT
ACGATATGGA ATAACATCCT GCTGTTCAAT GAACATACCT GGGGCGCTTA TAACAGTATC
TCCAATCCGG AAGATCCCAA GGTGATTTCA CAATGGGGAT ATAAGCAGTC ATTCGCATTA
AAAGGACATG CGCAGTCTGC CGCTATGTTA ACATCCGCCA CCGGCGGAGA AGCCATTGCT
AATGCTGTTG ATGTATACAA TACCATTGGC GAAAACCGTA CAGAATTAGT ACGGATACCT
GCGGCACAAA GTACAGCCGG AGACCTGGTG AAAGATGTCA ATGGAAAGAA AGTGCCATCG
CAACGTCTCA GCACCGGCGA ACTGGCCATA CTGGTACAAC ACATTGCACC ATATGCTAAA
CAACGCTTTA CCATACAGGC AGGAAAGGCT TATAGCAACA CGAAATCAGT TGTCAGCAAT
ACCACCCTGC AAAATGAGCT GTATAAGATA TCACTCAATG CGCAAACAGG TAATATCGAC
AAACTGGAAC GTAGTGGTAT TCCCCATAAT CTCGCGGATT CCGGTGGATT GAACCGTTAC
TCTTATCTGC CTGGTGATTC ATTGGAACAT ATTGCATATG CAGGTCCGGC GACAATCACC
GTAAAAGAAA AAGGTCCGCT GGTAGTGAGT GTAATTGTGA CAGCGGTAGC TCCCGGTGCT
AATGAACTCA CTACTGAAAT AATGCTGGTA GCAGGTGAAG ACCGTGTACA GATCATCAAT
AACATCGATA AAAAAGCGAT TACAAAGAAA GAAGGCGTAC ACTTCGGATT CCCTTTCAAT
GTAGCAGATG CGCAGGTCAG ATATAGTATT CCCTGGGGCA GTATCAATGT TGAAGCAGAC
CAGTTACAAT ATGCCAACCG CAACTGGTAT ACCGTACAAC ATTGGGTGGA CGTATCCAAT
AAAGACTATG GTGTCACCTG GTCAACACCA GATGCGCCAT TGTTTGAAGT AGGGGCACAG
ACAACGGCAG GTCTTACCGG TGGATTACAT GATTCTCCTA AATGGATCAG TTTTACAGAA
CAGCATCCTG CTATCTATTC CTGGATAATG AACAACCTGT GGCATACCAA TTTCCGGCAC
GCACAGGAAG GACCGACTAC TTTTCGTTAT TACCTGACGG TACACCATAG CTATGATGCC
TATGCTGCGA ATCAACAGGG ACTGGCTAAT CACCGTCCGT TGATAGCTGC TCCGGCCAGC
GGTCCGGCAA CAGAGTCTTT GCCTTTTACC ATCAACTCCA AAGCAGTCTA CGTAGAGAGT
CTGAAACCTG CCGCTGATGG CAAAGGGGTG ATCGTATGGC TGGTGAATAC AGCGCCGGTT
GAAACAGGCG TTTCCTTCAA TACGAAAAGT AATGCTGCAA AACTGCAGAT CACAGCTACC
AATATGCTGG AAGAATATAA ACAGTCATTA GATAACAACA TCATCCTGCC TGCAAAAGGA
GTTATGATGG TACGTGTGGA AAACAAATAA
 
Protein sequence
MYIHIMNRLK RWYTDKYCLL LLAVMTIDLA HAQQTPAFTS IALQPTVAYL KHQGRAARMV 
RLLFHGGNSY APAMAYMTFN GLYDSIPLVA TEKGIDIFEL PLPGGPVHHE TQLYVKVKSG
GREYTARCMV APAPEWKVYL LPHSHVDIGY TNVQEKVMQL HMNNIDEAIK IAERTQNYPP
DARYKWNTEA FWVVDHYLAA ADEAKKKAFW EAVKKGWINI DGAYANINTS VTDSRQLMQM
FYTAVKTAKD HGLDINTMFQ GDVPGASWGL ASQSGITGIR YFWGAPNADD RIGQSPAWRD
RPFYWQSPGQ QKMLYWQSEP YSIGYRLKGS KIPNFFTVED PVPYYTGHPS ENFLNPYLFD
YLAALDKKKF PYNMTIMTWA MSDNAPIDPE LPEAVKAWNE RYASPRLVIT SVKQFFNDFE
AAYADRIPVV SGDYTEFWTD GIASAARETG YNRNASETLQ QADAVWALRG KADYPATAID
TIWNNILLFN EHTWGAYNSI SNPEDPKVIS QWGYKQSFAL KGHAQSAAML TSATGGEAIA
NAVDVYNTIG ENRTELVRIP AAQSTAGDLV KDVNGKKVPS QRLSTGELAI LVQHIAPYAK
QRFTIQAGKA YSNTKSVVSN TTLQNELYKI SLNAQTGNID KLERSGIPHN LADSGGLNRY
SYLPGDSLEH IAYAGPATIT VKEKGPLVVS VIVTAVAPGA NELTTEIMLV AGEDRVQIIN
NIDKKAITKK EGVHFGFPFN VADAQVRYSI PWGSINVEAD QLQYANRNWY TVQHWVDVSN
KDYGVTWSTP DAPLFEVGAQ TTAGLTGGLH DSPKWISFTE QHPAIYSWIM NNLWHTNFRH
AQEGPTTFRY YLTVHHSYDA YAANQQGLAN HRPLIAAPAS GPATESLPFT INSKAVYVES
LKPAADGKGV IVWLVNTAPV ETGVSFNTKS NAAKLQITAT NMLEEYKQSL DNNIILPAKG
VMMVRVENK