Gene CPS_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_2237 
Symbol 
ID3521332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp2337258 
End bp2339261 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content36% 
IMG OID637284694 
Productcollagenase 
Protein accessionYP_268962 
Protein GI71280863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.971967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TAACAATACT TCTTCTGACT CTCGCTATAT CTGTTGTAAT TACTGCGTGT 
AGTGTTAACG ATGAGGCTGA TACTACTAAG GTCATGCAGT CTGTCTTAGT ACCAATAGTA
AATCCAATGG TTGCTCAAGT ATTAGCGACT GATATAGAGC AGTTATGGAG CCAAGATTTT
AATCATTATC AACAGGGATT ACTAAAAGCT GTTGCGGATG AAATATCAAT GGAAGCTTTA
AAAGGAGACT TAACTAATGA TAAATTAGAG AAGCTTACTT TTTATCTTAG AATTTACAGT
AGCTTTGGTG CAGATAAATA TTGGACCGAA GAAACCGCAA TATCTGTTAA TAGCGCATTA
GACAATTTAT ATAATATGCC TGGTTTTTTT GAGGTGAGTC AGACTACAGC ACGCTTGCAT
GAAAATTACG CTGTTGCTTT GTATCGTTTA TATTTTTTAG CACCTTTACA ACCATTTATA
GTAGAGCAGG TAAAGCCGTT AAGTCAGTTA ATTAACCTGT ATGCTTCCGC TGATCTTTCT
AATACAACTA CAGTGAACTC TGATAAAGAC ACAGCGATAG ATTATGCACT GTGGGAAGTA
TTACGTGCTG GCGCTATTTT ACCCTACGAA GCCCGAAGAA AAAATACTGC TGAATTCATG
AAAGGTGTAC ATGGTGAAGG TGAACTTCAA CAGGCATTAA TTCAATTTAT CACTGCTAAA
AATAGTACTT TAGTGGGGGA TGACTGGCCT AAGCAACACG CTCTTTGGGC ATTGGCACAG
TATTATAATT TATATACGAA AGAATATTGG AATGACTACT ATGAGCACTC AGCTGAGGAT
CAGAAACGTT TAGATGACGA TAAATTAACC CTCAAGATTG AAGGTGAAAT GGATACGCTT
GATAACAGTG TATGGGCGGC GTTAACCAAT GATAAAGCAA CATCAGTAGA GCAAAATAAG
ACACTTTTTA GTGTGCCTTA TGTCGTCAAC ACTTTTCGTG GAAAGTCTGA ATGTGAAGAG
GGTACGCTAG TCGATCGCTG CATTTCCCCA TCAATTGAAC AAGCATTGCC TATTAACCAT
GAGTGCTCAA GTAAAATATA TATTTTAGCC CAAGCCATGT CTTCAGCACA GTTAAGTGAT
GCTTGTCAGC AGTTAATCGC TCAAGAAAGT AATTTCCATG AAATATTAGC GACTAATAAT
CAGCCCGTTG CCAACGATTT TAACGATAAA TTACGAGTCG TTATTTTTGA TAATCACGCA
GAGTATAATA AATTCGGCCA GCTAATTTTT GATATTAATA CCGATAATGG TGGTATGTAC
ATTGAGGGAA CGACACAAGA TCCTAACAAT ATTGCGACTT TTTACTCATT TGAACATTTC
TGGGTACGAC CTGAATTTGC TGTTTGGAAT TTAAACCATG AGTTTGTTCA TTATTTAGAT
GGACGCTTTG TTAAATACGA TACTTTTAAT CATTTTCCAA GTCATATGGT GTGGTGGTCT
GAAGGACTCG CTGAATATGT TGCCAAGGAA GATAATAATC CAAAAACTTT CAAATTAGTC
AATGACACAA CTCCAGAAGA CTGGCCCAGT TTAACGGATA TTTTTAATAC TGAATATAAA
GACGGTACTG ATAGGGTATA TCGGTGGGGC TATTTAGCTG TGCGCTTTAT GAATGAAAAA
CATCAAAATG AATACAGGAA AATGGCGCAC TACTTAAAAA CAGACTTTTT TGATGGTTAT
AAAAAATTAG TTGAGGAGTC AGGTAAAAAG TATGCAGCAG AGTTTACTCA ATGGTTAGAT
GAACATAATG CTAACTATGT GGCGGAAGAA GATGTAAATA ACCCACATAA ACCACGTCAA
TTCTATCGTT ATACGTATAA AGATTACTTA CAGCCAAGTC ATTTAACGGA AGATAAGCTG
CATATGCACT GGCAGTATTG GCATGAAAAT GCTTTAAAAT CATTAGATAA AAAATTGGCT
AATAAAAATA CTGTGACTAA ATAG
 
Protein sequence
MNKLTILLLT LAISVVITAC SVNDEADTTK VMQSVLVPIV NPMVAQVLAT DIEQLWSQDF 
NHYQQGLLKA VADEISMEAL KGDLTNDKLE KLTFYLRIYS SFGADKYWTE ETAISVNSAL
DNLYNMPGFF EVSQTTARLH ENYAVALYRL YFLAPLQPFI VEQVKPLSQL INLYASADLS
NTTTVNSDKD TAIDYALWEV LRAGAILPYE ARRKNTAEFM KGVHGEGELQ QALIQFITAK
NSTLVGDDWP KQHALWALAQ YYNLYTKEYW NDYYEHSAED QKRLDDDKLT LKIEGEMDTL
DNSVWAALTN DKATSVEQNK TLFSVPYVVN TFRGKSECEE GTLVDRCISP SIEQALPINH
ECSSKIYILA QAMSSAQLSD ACQQLIAQES NFHEILATNN QPVANDFNDK LRVVIFDNHA
EYNKFGQLIF DINTDNGGMY IEGTTQDPNN IATFYSFEHF WVRPEFAVWN LNHEFVHYLD
GRFVKYDTFN HFPSHMVWWS EGLAEYVAKE DNNPKTFKLV NDTTPEDWPS LTDIFNTEYK
DGTDRVYRWG YLAVRFMNEK HQNEYRKMAH YLKTDFFDGY KKLVEESGKK YAAEFTQWLD
EHNANYVAEE DVNNPHKPRQ FYRYTYKDYL QPSHLTEDKL HMHWQYWHEN ALKSLDKKLA
NKNTVTK