Gene Cpha266_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1901 
Symbol 
ID4570860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2207738 
End bp2208796 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content53% 
IMG OID639766483 
Productpeptidyl-arginine deiminase 
Protein accessionYP_912341 
Protein GI119357697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.459054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATT TCCAGTATCG CATGCCCCCG GAGTGGGCTT GTCACAAAGC CACCTGGCTT 
TCCTGGCCGC ACAAACGTGA ATCCTGGCCA GGAAAATTTG AACCGGTTCC GGCAGTTTTT
GTCGAGATTG CTTCATGGCT GAGTTCATCG GAAGAGGTAC ATATCAATGT GCTTGACGAA
GCGATGGAAG TGGAGGTGCG CGAACTGTTC CGAAAGGCAG AGTACGATCA TTTGCGAAGG
GATCGACTTG TGCTGCACCG CATTCCAACC AATGACGCCT GGTGTCGCGA CCACGGCCCG
AACTATGTAT TCCGCCAGGG CGGCACCGGC CTGGAGAAGG TGATTCTGAA CTGGCAGTTC
AATGCCTGGG GAGGAAAATA TGAGCCCTGT GATGATGACA ATGCAGTGCC TGTGCGGATT
GCGGAACAGC AGCATTTGCC TCTGGTTTCG ATCGACATGG TGCTTGAAGG TGGAGCAATC
GATGTGAACG GGAACGGTTT GCTGCTGACG ACCGAAGCCT GTTTATTGAA CAGGAACCGC
AATCCCGGAA TGAGTCGTCT TGAGATCGAA GGCGCGCTTG GCAGGTATCT CGGTATTGAA
AAGGTACTCT GGCTCGGCGA CGGTATAGCA GGGGACGATA CCGATGGTCA TGTTGATGAT
ATGGCGAGGT TTGTCAATGA ATCAACCGTC GTGATAGCCG TTGAAGACGA TGCTGCTGAT
GAGAATTATG AACCGCTTCA GGATAACTAT CGACTGCTCA AGAGTTTTAC CGATCTCAGG
GGAGAGCCTC TCAACGTGGT GAAGCTGCCC ATGCCGGATC CGGTTTACTA CGATGGCGAA
CGTCTTCCTG CAAGCTATGC GAACTTCTAT ATTGCCAACA GTGTTGTGCT TGTACCTCTA
TACCGGTGCG ACGCCGACAA AAAGGCGCTT GCTGTTCTGC AGGAGTGTTT TCCCGGAAGA
AAGGTTGTGG GTATCGACTG TTCAGACCTT ATATGGGGAT TGGGAGCCAT TCACTGTATC
ACCCACGAAG AGCCGGATCT GCCGATACGG GAGAGGTGA
 
Protein sequence
MMDFQYRMPP EWACHKATWL SWPHKRESWP GKFEPVPAVF VEIASWLSSS EEVHINVLDE 
AMEVEVRELF RKAEYDHLRR DRLVLHRIPT NDAWCRDHGP NYVFRQGGTG LEKVILNWQF
NAWGGKYEPC DDDNAVPVRI AEQQHLPLVS IDMVLEGGAI DVNGNGLLLT TEACLLNRNR
NPGMSRLEIE GALGRYLGIE KVLWLGDGIA GDDTDGHVDD MARFVNESTV VIAVEDDAAD
ENYEPLQDNY RLLKSFTDLR GEPLNVVKLP MPDPVYYDGE RLPASYANFY IANSVVLVPL
YRCDADKKAL AVLQECFPGR KVVGIDCSDL IWGLGAIHCI THEEPDLPIR ER