Gene Cpin_4945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4945 
Symbol 
ID8361121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6182401 
End bp6184383 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content47% 
IMG OID644967094 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_003124579 
Protein GI256423926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.142964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.295957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA TAATCACATT TGTCGCCACA GCATTTTTGC TGCTGGCAGG AGCAGCTTCT 
TTTTCTCAGA CAATGACTTT AAAAGTAAAT GGTCCGCAGT CGGAGGTATC TCCAACGATG
TGGGGAATCT TTTTTGAGGA CATTAACTTC TCCGCAGATG GCGGTATCTA CGCCGAATTA
ATAAAGAACC GCTCTTTTGA ATTTACGGAA CCGATGATGG GATGGAAAGA GGTGAAGAAA
GAGGGAGCAG GTAATATATT AATAGTGAAC AGGGAGACCG GGCATACGGC TAATCCCAGG
TATGCACATA TCACCGTAAC TGCGGACAAA GGTAGTTACG GACTGTTCAA TGAGGGTTTC
CGTGGTATGG GCTTTAAAAA AGGGTTATCA TATAACTTTT CTTTTCTGGC ACGCAGCACG
GCAGGTAAAG TAAGCGGAAA ACTGGTATTA GTGGATGATA AGGGTACCCC TATAGGAGCG
GTCGCAGTCT CAGCAGAAGA TAAAGCCTGG AAAAAGTATA CGGCTACAGT GACCGCTGAG
CGTACGGTGG CCAAAGGAGG CGTACAGCTG TTATTTTCCG GTACGGGCGC CCTTGATATG
GATATGGTAT CGTTATTCCC CCAGGATACC TGGAAACAGC GTCCGGGAGG CTTGCGTAAT
GACCTGGTAC AGTTACTGGC CGACCTGCAT CCCGGATTTG TGAGATTTCC GGGCGGTTGT
ATTGTTGAAG GACGGGATCT GGCCAACAGG TATCAGTGGA AAAAAACGGT AGGTAAACCA
GAGGATCGTA CGTTAATCGT AAATCGTTGG AATACGGAAT TTGCGCATCG GGCACCCGGT
GATTATTTCC AGAGCTATGG ACTGGGCTTT TTTGAATATT TCCAGTTATC GGAAGATATA
GGTGCTGAAC CCTTACCGAT TCTCAATTGT GGGATGGCAT GTCAGTTTAA TACAGGAGAG
GTGGCAGCTG ATGAGGATGT GGAAGCGTAT ATACAGGATG CGCTGGACCT GATTGAATTT
GCCAACGGTG CCACGACAAC CGAGTGGGGG CAACTGAGAG CGGAAATGGG GCATCCTGCG
CCGTTTAACC TGAAATTGAT GGGAGTAGGT AATGAGCAGT GGGATAGTCA GTATATTGCC
CGTTATCAGC GTTTTGAGGA GGTGCTGAAG ACTAAACATC CTGAGATAAA ACTGGTATCC
AGTGTAGGGC CATTTTCCAG CGGCGAGCGA TTCGATTATC TGTGGTCTAA ACTGAAGCCT
TCCAAGGCGG ATCTTGTAGA TGAACATTAT TATATGCCAC CGGAGTGGTT TCTGAAAAAT
AGTGCCCGTT ATGATAAATA TGAGCGGAAA GGGCCAAAGA TCTTTGCAGG AGAATATGCC
GCGCATGTTA AAGTACCGCA GGGTAAAAAA GAGACAGATA CTGCAGAAGG AAGGAATACC
TGGGAGAGTG CATTGGCAGA AGCAGCGTTT ATGACGGGAT TGGAAAGAAA TGCGGATCTT
GTACAGATGG CTTCCTATGC GCCATTACTT GCGCATGTGG ATGCATGGCA ATGGAGACCC
GATCTGATCT GGTTTGATAA TCTGCGTTCA GTGGGCACAC CGAATTATTA TGTGCAACGT
CTTTTTGCTA ATAATAAGGG TACGCATACC GTATCTGTTA CATCCGGTAA TCGGGCATTA
ACAGGGCAGG AGGGTATTTA TGCCAGTGCA ACGATTGATA AGCAGGCCCG CAAAATTCTG
CTGAAGGTAG TGAATACCAC AGATAAGGCG GCAGCGTATA CTATTGCGTT GGAAGGCGCT
GTAGCAGCAA CAGGTACAGC TACGCAGGAA GTGCTGACGG CGAAGGCTAA AAGTGATATC
AATACGCTGG ATGCACCATC GGTGGTAATA CCTGTTACTG CACAGATCAA AGCCGGAAAG
AATAAAGTGG CAATTACTGC AGGCGCTAAT TCACTGAATG TAATCACGAT ACCTTATAAA
TAA
 
Protein sequence
MKRIITFVAT AFLLLAGAAS FSQTMTLKVN GPQSEVSPTM WGIFFEDINF SADGGIYAEL 
IKNRSFEFTE PMMGWKEVKK EGAGNILIVN RETGHTANPR YAHITVTADK GSYGLFNEGF
RGMGFKKGLS YNFSFLARST AGKVSGKLVL VDDKGTPIGA VAVSAEDKAW KKYTATVTAE
RTVAKGGVQL LFSGTGALDM DMVSLFPQDT WKQRPGGLRN DLVQLLADLH PGFVRFPGGC
IVEGRDLANR YQWKKTVGKP EDRTLIVNRW NTEFAHRAPG DYFQSYGLGF FEYFQLSEDI
GAEPLPILNC GMACQFNTGE VAADEDVEAY IQDALDLIEF ANGATTTEWG QLRAEMGHPA
PFNLKLMGVG NEQWDSQYIA RYQRFEEVLK TKHPEIKLVS SVGPFSSGER FDYLWSKLKP
SKADLVDEHY YMPPEWFLKN SARYDKYERK GPKIFAGEYA AHVKVPQGKK ETDTAEGRNT
WESALAEAAF MTGLERNADL VQMASYAPLL AHVDAWQWRP DLIWFDNLRS VGTPNYYVQR
LFANNKGTHT VSVTSGNRAL TGQEGIYASA TIDKQARKIL LKVVNTTDKA AAYTIALEGA
VAATGTATQE VLTAKAKSDI NTLDAPSVVI PVTAQIKAGK NKVAITAGAN SLNVITIPYK