Gene Cpha266_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0056 
Symbol 
ID4571248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp60768 
End bp62186 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content52% 
IMG OID639764658 
Productpeptidase M12A, astacin 
Protein accessionYP_910550 
Protein GI119355906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTA AAAAAAACGG AAACACTCCA CAGGAGTATG GCGAATTCTC TCCATCAGGT 
AAGCTGCGTA CCGCTATTAT CGAGGGGAAT ACGTTTGGGT ACAAGTCGGT TCAGTATACT
GACGTTGATG GAATGGCCAT GTTTGAAGGG GACATCATTC TCGGCAAGGT TGCCGATGTC
GACAGCAAAA CCGAGCAGCG CAAACGGGAG ATACAGCAGG GGGTTACGCT GCGGGGCATT
ACCATTACAG GAGCCAAGTA CCGATGGCCG AACTGTAAAG TCCCCTACAC CATCGATACG
GCTCTGCCTA ATCAGAGCAG AGTAACCGAT GCCATAGCCC ACTGGGAGGC AAAAACAAAG
TTCCGGTTTA TTCTTCGAAC AAATGCCAAT GCGTCAAGTT ATCCTGACTG GGTAACATTC
AGATCGGGTT CCGGCTGCAG CTCCTATGTC GGGAAACAGG GCGGCCAGCA ATACATTAAT
CTTGCCTCGG GATGCTCGAA AGGCAATACC ATTCATGAAA TCGGCCATAC CATAGGGCTT
TGGCACGAAC ACAGCCGTGA AGACCGGAAC GCATTTGTCA CCATTCACTG GGATAAAATC
ATCGCAGGGT ATGAACACAA TTTCAATCAG CAGATAAGCG ATGGTGATGA TGTCGGCGCT
TATGACTACG GATCCATTAT GCACTATCCG AGAACCGCCT TTTCAACTGA CGGCTCGGAA
ACCATCACCC CGACCGATCC GTCCGCATCG ATAGGCCAGA GAACTGCTCT CAGCGCCGGT
GACATTGCGG CAGCAAACTC TCTCTGCCCG ACCGTTTCGC TCTGTCCCGC AGCGCCGAAA
ACCTGTCCCG GTGCACCGAT ACAGGTTTGC CCTGTTTCAC CGAAACTCGT ATGTCCTCCG
GGAATAAAGC TCGCCTGTCC TCCGGGAATA AAACAACTTT GTCCTCCGGG AATAAAACAA
AGTTGTCCCT CAGCACCAAT TCAGGTTATC TGTCCGCCAG GGATAAAACT CGCTTGTCCT
CCGGGAATAA AAGTCACTTG TCCTCCTGTG CCGAAAATAC CGATCTGCCC ACCGTCACCG
GTTCCGGGAT GTGCTGCCGG CCCGACAAAC AAACCGTGGG TCGGACCGGA GGGGTACACA
ACAACCTATC GGCTTGATCC TGCGTCCGGA GCTTACTACA GCGATGAGGC CCCTCCTCCA
GGCATGAATC AGATGCCTCC GGTTGTCATC AACATTAATT TTCACGGTTA TCAACCTCCT
TCCATTCAAT CGGATTATGC ACAATACGAC CCCTCCGCCT ATGAAAATCA GGACTGGACA
GCAACGGAGT ATCCTGATCC CGGAGAGGAA GCTGATGATT CGATAACAAA CGAAGAGAGC
GAAGCACCGG AAGATTTCAA TCCTGAATGT TCGGAGTAA
 
Protein sequence
MARKKNGNTP QEYGEFSPSG KLRTAIIEGN TFGYKSVQYT DVDGMAMFEG DIILGKVADV 
DSKTEQRKRE IQQGVTLRGI TITGAKYRWP NCKVPYTIDT ALPNQSRVTD AIAHWEAKTK
FRFILRTNAN ASSYPDWVTF RSGSGCSSYV GKQGGQQYIN LASGCSKGNT IHEIGHTIGL
WHEHSREDRN AFVTIHWDKI IAGYEHNFNQ QISDGDDVGA YDYGSIMHYP RTAFSTDGSE
TITPTDPSAS IGQRTALSAG DIAAANSLCP TVSLCPAAPK TCPGAPIQVC PVSPKLVCPP
GIKLACPPGI KQLCPPGIKQ SCPSAPIQVI CPPGIKLACP PGIKVTCPPV PKIPICPPSP
VPGCAAGPTN KPWVGPEGYT TTYRLDPASG AYYSDEAPPP GMNQMPPVVI NINFHGYQPP
SIQSDYAQYD PSAYENQDWT ATEYPDPGEE ADDSITNEES EAPEDFNPEC SE