Gene Cpha266_2165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2165 
Symbol 
ID4570766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2506762 
End bp2508024 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content50% 
IMG OID639766740 
Productpeptidase U32 
Protein accessionYP_912594 
Protein GI119357950 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTTT CCGAGTCTGT GACAAGCGAA AAAAAAATTG AACTTATAGC GCCTGCCGGC 
GACTGGACCT CACTCCGCAC CGCACTGCAA GCAGGAGCCG ATGCCGTCTA TTTCGGAGCT
GAAGGCTATA ACATGAGGGC CGGAAGCAAT AACTTCACTC CGGCTGATTT TCCCGCCATC
ATGACGCTCT GCAGTGAGTT CAACGCCAAA GCGTATCTGG CGCTGAACAC GATCGTCTAT
GACGGCGAAC TGAAAAAGAT GGTTCAAACC GTCTCCGCTG CCAAAACGAC AGGCTTCGAT
GCCGTTATCT GCTCGGACAT GGCTGTCGTC GATGCATGCC GAAAAGCAGC AATGCCCTTT
CATATGTCAA CACAGGCTTC GATCAGCAAC TACAGCGCAG TAAAATTCTA TGCCGACCTT
GGCGCAAAAA TGATCGTGCT GGCCCGCGAG CTTACCATTG ACCAGGTACG CCATATTACC
TCGAAATTAA AGGCCGACCG TCTCGATGTA CAGATCGAGT GCTTTGTTCA CGGAGCGATG
TGCGTCGCTG TTTCCGGGCG CTGCTTCATG TCACAGGAAC TTTTCGGACG CTCCGCCAAC
CGGGGACAGT GCGTTCAGCC CTGCCGAAGG CAATATATCA TCACCGATCC TGAAGAGAAC
CAGGAGCTTG AGCTTGGTAC CGATTATGTT ATGAGTCCGA AAGACATGTG CGCAGTGGAA
TTTCTTGACG TTCTCATGGA TGCGGGAATC AGCGCATTCA AAATCGAAGG ACGAAGCCGC
AGTCCGGAAT ATGTTCATAC TGCGACAACA GCTTACCGAC GGGCGATCGA CTTCTGCACG
AGCCACCGCA ACAGTCCGGA ATTCAGAACA GAGTACAACT CCTTATCGAA ACAGCTTAAA
GAGGAACTCG CACGGGTATA TAACCGGGGA TTTTCGGAAG GATTTTATTT TGGAAAACCC
TTCGATGCCT GGACCAGAGA GTACGGCTCA ATGGCCTCCG AAAAAAAAAT CTATATCGGA
GAGGTTAAAA AATATTATCC AAAAGCGGAG GTGGCTGAAA TCCTCATCTT TGCCCGAGGC
CTCAAACAAG GCGATAAGCT CTCTGTTCTC GGCCCGAAGA CAGGAGTTAC AACCCTTTTT
GCCGAAAGCT TTTATACCAA CGATCTTCCT GCAAAAACGG CTGTCAGGGG CGACAGCGTC
ACCATCAAAT GTGCAAAAGT GAGAAAGAAC GACAAGGTAT ATGTGCTTGA AAAAAGAAGC
TGA
 
Protein sequence
MNLSESVTSE KKIELIAPAG DWTSLRTALQ AGADAVYFGA EGYNMRAGSN NFTPADFPAI 
MTLCSEFNAK AYLALNTIVY DGELKKMVQT VSAAKTTGFD AVICSDMAVV DACRKAAMPF
HMSTQASISN YSAVKFYADL GAKMIVLARE LTIDQVRHIT SKLKADRLDV QIECFVHGAM
CVAVSGRCFM SQELFGRSAN RGQCVQPCRR QYIITDPEEN QELELGTDYV MSPKDMCAVE
FLDVLMDAGI SAFKIEGRSR SPEYVHTATT AYRRAIDFCT SHRNSPEFRT EYNSLSKQLK
EELARVYNRG FSEGFYFGKP FDAWTREYGS MASEKKIYIG EVKKYYPKAE VAEILIFARG
LKQGDKLSVL GPKTGVTTLF AESFYTNDLP AKTAVRGDSV TIKCAKVRKN DKVYVLEKRS