Gene Cag_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1120 
Symbol 
ID3748338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1510560 
End bp1512143 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content41% 
IMG OID637773651 
Producttype I restriction-modification system specificity subunit 
Protein accessionYP_379425 
Protein GI78189087 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAC AAAAAGCAAC CCTTCGGCAA ACTCAGGGAA AACAGGAAGA ACCTTTAGAA 
AAACAGCTTT GGAAAACCGC CGACAAACTC CGCAAGAACA TTGATGCGGC AGAATACAAA
CACATTGTGT TAGGGTTAAT CTTCTTGAAA TATATTTCTG ATTCATTTGA AGAACTCTAT
GCAAAGCTAC AAGCAGAGGA AGCAAACGGA GCAGACCCCG AAGACAAAGA TGAATACAAA
GCTGAAAATG TATTCTTTGT TCCGCAGGAT GCACGATGGA ATTATTTGCA ATCGAAAGCA
AAACAACCTG AAATTGGAAA GTTTGTGGAC GATGCAATGG ATGTTATAGA AAAAGAAAAT
GCTTCACTAA AAGGAGTTTT ACCAAAAGTA TTTGCCCGAC AAAATCTTGA TCCAACAAGT
TTGGGCGAAC TGATTGACTT GGTTGGAAAC ATTGCCTTAG GTGATGCAAA AGCAAGAAGT
GCCGATGTGC TTGGGCATGT TTTTGAATAT TTCTTAGGTG AGTTTGCTCT TGCAGAAGGC
AAAAAAGGTG GGCAGTTTTA TACGCCAAGA AGCGTTGTAG AATTATTGGT TGAAATGTTG
GAGCCATACA AAGGAAGAGT TTTTGACCCT TGCTGTGGTT CGGGTGGAAT GTTTGTTCAC
TCCGAAACGT TTGTAACAGA GCACCAAGGG AAAGTAAACG ACATCTCTAT TTACGGGCAG
GAAAGCAACC AAACAACGTG GCGCTTATGC AAAATGAACC TTGCGATTCG AGGTATTGAT
AGCTCACAAG TGAAATGGAA CAACGAAGGC TCTTTTTTAA ACGATGCACA TAAAGACCTG
AAAGCCGATT ACATTATTGC TAATCCACCA TTCAACGTGA GTGATTGGGG TGGTGATTTA
ATGCGAAGCG ATGGACGTTG GCAATATGGT ACGCCACCAA CAGGCAATGC CAACTTTGCA
TGGATGCAAC ATTTTATTTA CCACTTAGCA CCCAATGGAC AAGCAGGTGT TGTATTAGCA
AAAGGTGCTT TAACATCTAA AACTTCAGGT GAAGGCGATA TACGAAAAGC ATTAGTTGAA
AACGGTTTGA TTGATTGTAT TGTAAACCTG CCTGCCAAGT TGTTTTTAAA TACACAGATT
CCTGCTGCCT TATGGTTTCT TCGTAGAGAT GCAAAATTTT TCGTCTCTAC AAATGGAAAA
TTTCGCGACC GAAGCAATGA AATATTATTT ATTGATACCC GAAACTTAGG GCATTTAATA
AATCGCAGAA CCCGTGAACT ATCAAAGGAA GACATATATA AAATCGCCAG CACTTACCAC
GCATGGAGAA CGCTGCCTGA GGCTCTCAAT GGCAGCGCCT ATGCAGATAT CCTTGGCTTT
TGTGCATCCG TTGCCATAAG CAAAGTAGCC GAATTGGATT ATGTGCTTAC GCCAGGACGT
TATGTAGGCT TACCCGATGA TGAAGATGAT TTTGATTTTG CGGAACGTTT TACAGCGTTA
AAAGCCGAGT TGGAAATGCA ATTGCAAGAA GAAGCTCAAC TGAATGCAGT GATTTCCGCT
AACCTTTTAA AGATTAAGTA TTGA
 
Protein sequence
MAKQKATLRQ TQGKQEEPLE KQLWKTADKL RKNIDAAEYK HIVLGLIFLK YISDSFEELY 
AKLQAEEANG ADPEDKDEYK AENVFFVPQD ARWNYLQSKA KQPEIGKFVD DAMDVIEKEN
ASLKGVLPKV FARQNLDPTS LGELIDLVGN IALGDAKARS ADVLGHVFEY FLGEFALAEG
KKGGQFYTPR SVVELLVEML EPYKGRVFDP CCGSGGMFVH SETFVTEHQG KVNDISIYGQ
ESNQTTWRLC KMNLAIRGID SSQVKWNNEG SFLNDAHKDL KADYIIANPP FNVSDWGGDL
MRSDGRWQYG TPPTGNANFA WMQHFIYHLA PNGQAGVVLA KGALTSKTSG EGDIRKALVE
NGLIDCIVNL PAKLFLNTQI PAALWFLRRD AKFFVSTNGK FRDRSNEILF IDTRNLGHLI
NRRTRELSKE DIYKIASTYH AWRTLPEALN GSAYADILGF CASVAISKVA ELDYVLTPGR
YVGLPDDEDD FDFAERFTAL KAELEMQLQE EAQLNAVISA NLLKIKY