Gene Cag_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1122 
Symbol 
ID3747276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1513370 
End bp1514656 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content34% 
IMG OID637773653 
Productrestriction endonuclease S subunits-like 
Protein accessionYP_379427 
Protein GI78189089 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.878738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA TGAGTGAGTG GAAGGAATAT AAACTAAAAG ACTTAGGTCT TCTACAAAGA 
GGACGTTCTC GACATCGTCC TAGATATGCC TTTCATTTGT ATGGCGGTAA ATATCCGTTT
ATTCAAACAG GAGAGATTCG AGAAGCATCT AAATACATCA CAAAATTCGA AAAAACATAT
AGTGAAGAAG GACTAAAACA ATCTAAGTTG TGGCCGAAAG GAACATTATG TATCACTATT
GCAGCCAACA TTGCAGAATT GGCTATATTG AATTTTGATG CTTGTTTTCC TGATAGTGTT
TTAGGATTTA TTCCTAATGA TAAAATAGCC AATGCAGATT TTATCTACTA TATATTAACA
CATTTCCAAA AAGAATTAAA ACATATTGGA GAAGGCTCAG TACAAGACAA TATAAATCTT
GGGACATTTG AAGATTTACT TTTTCCAATC CCTCCCCTTC CCGAACAACG TGCCATCGCC
TCCGTCCTGA GCAGCCTCGA TGACAAAATA GAGCTGCTCC ATCGCCAAAA TGCCACGCTT
GAAAAAATGG CTGAAACGTT ATTTAGGCAA TGGTTTATAG AAAGAAAAAG TCTGAACTAT
GATTCTTATG ATTTGCTTGA TGAACATGAT TTAAAAAATC AAAAGAATCA TAACAATCAA
AAAAATCATA GTTCAGACAA TGGAGAGGAG GCAATTGAGG AATGGAAGAT TGGGAAGGTT
TCTGATTATG CGTTACATTT GAAAGATTCT ATTCAGCCTC AAAAGAACCA ATCAACTTTT
TATTTTCATT ATAGCATACC ATCATTTGAC AATGATAAGA ACCCGATTAA AGAACTTGGA
AAAGAAATTC AAAGTAATAA ATACAAAGCA CCAAGATATT GCATTCTTTT CTCAAAATTG
AATCCTCATA AAGATAAAAG GGTTTGGCTT CTTCAAAACG AAGTAGAAAA AAATGCAATC
TGCTCAACTG AATTTCAAGT TGTATTACCA ATAAAAAGAC AGTATTTGTA TTTCTTATAC
GGTTGGCTAA CTCTAAATGA TAATTACAAC GAAATAGCTT CAGGAGTTGG TGGAACAAGT
GGAAGCCATC AAAGAATCGA CCCAAATACA ATTTATGATT TTCAATGCCC ACTTGTTACT
GAAAGCGTTA TTGAAAAATT TAATATTCAA ATAAAACCAC TTTTTAAGAA GCAAGTAATT
AACCAAACCC AAATCCGCAC CCTCACCGCA TTGCGCGATA TGTTGTTGCC AAAGTTGATG
AGTGGCGAGG TAAAAGTAGA TTATTAA
 
Protein sequence
MATMSEWKEY KLKDLGLLQR GRSRHRPRYA FHLYGGKYPF IQTGEIREAS KYITKFEKTY 
SEEGLKQSKL WPKGTLCITI AANIAELAIL NFDACFPDSV LGFIPNDKIA NADFIYYILT
HFQKELKHIG EGSVQDNINL GTFEDLLFPI PPLPEQRAIA SVLSSLDDKI ELLHRQNATL
EKMAETLFRQ WFIERKSLNY DSYDLLDEHD LKNQKNHNNQ KNHSSDNGEE AIEEWKIGKV
SDYALHLKDS IQPQKNQSTF YFHYSIPSFD NDKNPIKELG KEIQSNKYKA PRYCILFSKL
NPHKDKRVWL LQNEVEKNAI CSTEFQVVLP IKRQYLYFLY GWLTLNDNYN EIASGVGGTS
GSHQRIDPNT IYDFQCPLVT ESVIEKFNIQ IKPLFKKQVI NQTQIRTLTA LRDMLLPKLM
SGEVKVDY