Gene Cag_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1526 
Symbol 
ID3746585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2004998 
End bp2006158 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content39% 
IMG OID637774066 
Productrestriction endonuclease S subunits-like 
Protein accessionYP_379824 
Protein GI78189486 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG AAGCACTTGG TAAACTCGTT GACATCAAGA CAGGAAAATT AGATGTAAAT 
GCAGGAACAG AATACGGTAA ATATCCCTTT TTTACTTGTG CCAAAACAGT TTACAGAATT
AATCAATACG CATTTGATAA TGAAGCTATA CTTGTTGCTG GAAATGGCGA CTTGAACGTT
AAGTACTTTA AAGGAAAATT CAATGCCTAT CAAAGAACCT ATGTAATTGA GAATAAAGAA
GTAAATTTAT TATCCATGAA ATACTTGTAC TATTTTATGG AAACATATAT GATTCATCTA
AGAAATGGAG CTATTGGAGG AATCATTAAA TACATTAAAA TTGATCACTT AACTAAAGCA
GAAATCCCTC TCCCCCCACT TGACGACCAA AAACGCATTG CCCACCTACT CGGCAAAGTA
GAGCGGCTAA TTGCCCAACG CAAACAACAT CTGCAACAGC TTGACCAACT GCTCAAAAGC
GTTTTTCTGG AGATGTTCGG CTTCTTTGAT AAAACATATA CCAACTGGAC TATCGATACA
TTAACATCGC ACACAGAGAT CGTATCGGGT ATTACAAAAG GAAAAAAATA CAAAACCGAT
GAATTAATTG AAGTTCCGTA TATGCGTGTT GCAAATGTAC AAGACGAACA CTTCGTATTA
GACGAAATCA AAACGATCTC TGTAACCAAA AACGAGATCA AGCAGTATCG GCTTCTTGCT
GGCGATCTAT TATTAACAGA AGGTGGCGAT CCCGATAAGC TTGGGCGAGG CGCTGTTTGG
CAAAACCAGA TTGAAAACTG TATTCATCAG AACCACATTT TTCGTGTTCG AGTAAACGAT
AAATCCAGAA TTAACCCTGA CTATCTTAGC GCATTAATAG GATCTCCATA CGGAAAATCT
TACTTCTTTC GTTCTGCAAA GCAGACAACT GGGATTGCCT CTATAAACTC AACTCAGTTG
AAAAAATTTC CTATTGTAAT TCCCCCCATC GAACTCCAAA ACCGCTTCGC CACCATCGTT
GAAAAAGTTG AAAGCATCAA AACGCACTAC CAACAAAGCC TCAACAACCT CGAAACACTT
TACAACGCAC TAAGCCAAAA AGCCTTCAAA GGCGAGCTGG ATTTATCGCG CGTGGCGGTG
CTGGTGGACG TTACACCTTA A
 
Protein sequence
MKKEALGKLV DIKTGKLDVN AGTEYGKYPF FTCAKTVYRI NQYAFDNEAI LVAGNGDLNV 
KYFKGKFNAY QRTYVIENKE VNLLSMKYLY YFMETYMIHL RNGAIGGIIK YIKIDHLTKA
EIPLPPLDDQ KRIAHLLGKV ERLIAQRKQH LQQLDQLLKS VFLEMFGFFD KTYTNWTIDT
LTSHTEIVSG ITKGKKYKTD ELIEVPYMRV ANVQDEHFVL DEIKTISVTK NEIKQYRLLA
GDLLLTEGGD PDKLGRGAVW QNQIENCIHQ NHIFRVRVND KSRINPDYLS ALIGSPYGKS
YFFRSAKQTT GIASINSTQL KKFPIVIPPI ELQNRFATIV EKVESIKTHY QQSLNNLETL
YNALSQKAFK GELDLSRVAV LVDVTP