Gene Cpin_5940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5940 
Symbol 
ID8362121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7543376 
End bp7544476 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID644968079 
Productcarboxylate-amine ligase 
Protein accessionYP_003125559 
Protein GI256424906 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.111906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATA ACTTCACCCT CGGTATTGAA GAGGAATACA TGGTCCTGGA CCCCCAGACC 
AGAGAACTGA GATCTCATGA ACAAAAGATC GTAGAACAAG CCCACAGGGT GCTCAGAGAT
AAAGTAAAAG CTGAATTTCA CCAGGCAGTT GTGGAAGTAG GCACAGAGGT ATGCGCCAAT
ATAGATGAAG CCTGCGAAGA TGTATCAATG CTGCGCAGGA CAATTGCCAG CATAGCAGGC
GACCTGGGAT ACAGCATCGG CGCTTCCGGT ACACATCCTT TCTCCAAATG GCAACTCCAG
CATATTACAG ACAATCCCCG TTACTTTGAG ATCGTCAATG AAATGCAGGA TGCCGCCCGC
TCCAACCTCA TCTTCGGATT ACATGTACAT GTAGGCATGG AAAACAGGGA AATGGCCCTC
CATATCGCCA ACTCCGTACG TTATTTCCTG CCCCATGTGT TTGCACTCAG TACCAACTCT
CCTTTCTGGG AAGGTCGCAA CACCGGCTTC AAATCTTATC GGACCAAGGT ATTTGATAAA
TTCCCCCGTA CCGGTATTCC CGATTACTTC GCCAGTATCG AAGAATATGA CCGGTATATA
CAGTTGCTCG TAAAAACCAA CTGTATCGAC AACGCCAAGA AAGTATGGTG GGACCTGCGG
GTACACCCCT TCTTCAATAC AGTGGAATTC CGCATCTGCG ACGTACCGCT CACCGTAGTG
GAAACCTGTA CCCTGGCAGC ACTGTTCCAG GCTGTTTGCG CAAAGATTTA TAAACTGCGT
ATGCAGAACC TCAACTTCAT CATCTATAAC CGTGCACTGG TTAATGAGAA CAAATGGCGG
GCATCCCGCT ACGGTATTGA CGGTAACCTG ATCGACTTCG GTAAAGAAAT GGAAGTCAAT
GCCAGAGCGC TGATTCATGA ACTGCTCGAC TTCGTGGACG ATGTAGTAGA TGAACTGGGC
AGCCGTCACT ATATACAGCG CACCCTCGAC ATTCTCGAAA ACGGTACCGG CGCCGATCAG
CAGTTAAAAG TATACACAGA TACCAAAGAT CTGGTATCTG TTACCGACTT CATCACTAAT
AGCTTTCTGA AAGACTGCTA A
 
Protein sequence
MLNNFTLGIE EEYMVLDPQT RELRSHEQKI VEQAHRVLRD KVKAEFHQAV VEVGTEVCAN 
IDEACEDVSM LRRTIASIAG DLGYSIGASG THPFSKWQLQ HITDNPRYFE IVNEMQDAAR
SNLIFGLHVH VGMENREMAL HIANSVRYFL PHVFALSTNS PFWEGRNTGF KSYRTKVFDK
FPRTGIPDYF ASIEEYDRYI QLLVKTNCID NAKKVWWDLR VHPFFNTVEF RICDVPLTVV
ETCTLAALFQ AVCAKIYKLR MQNLNFIIYN RALVNENKWR ASRYGIDGNL IDFGKEMEVN
ARALIHELLD FVDDVVDELG SRHYIQRTLD ILENGTGADQ QLKVYTDTKD LVSVTDFITN
SFLKDC