Gene Cagg_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1068 
Symbol 
ID7268520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1317732 
End bp1319828 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content55% 
IMG OID643565913 
Producthypothetical protein 
Protein accessionYP_002462418 
Protein GI219847985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAT CTAAAACCCA ACGCCGGTCT ATATCGAGCG GTACCTCATC TGCACAAGGA 
TACCGCTTTC TCAACCCATA CAACTTCGTG CGCACGCTGG AAACGCGCAA TGCGCACATT
GCTCGATTGC TGGGCCGTTG CGCACCGCCG CCCCATGATC GGTATGTAGG GTTGACCGGA
CGCATCTATT GCCGGCTCAC CGCTACCACC CCGATCTTTG TCGCGGATGG TGAGAACGTG
CGCGAAGAGC ACGTCAACGG CAAGATACAT CGTCACTATC GCTTCTTTCG CGATCCGGAG
AACAAGGTCG CTATTCCCGG TACGAGTCTG CGCGGTGCAA TCCGCGCCAT CTTTGAAGCA
GCGACCAATT CCTGCTTTGC CCACTTTGCC GGCGATAAGC GATTAAGTTA TCACCTGTTG
CCAGAGCTGG CGCTGCAACT GGTTCCAGCG CGGGTGCGCA AAATCAACAC GCGGTGGGAA
TTGGAGTTGT TACCCGGAAC AACAACGATC ACACCCGGCC AACGACCTGC CGGTCCGCAG
TACGCTGCAT GGGTGCATGT GTACGATCCG TTGCAGAAGA GCAAAACGGT CGCTCAGTCG
CTCAGTAAAC CTTACGCCCA GCGTCAAAAA CTCTCGCTGA CCGGCTTTGC GCATGGTGAG
TTGTGTCACG CGATTATCGA GCGTATGATC CATCCGCGCC GCAATTTCAA GTTCTGGAAT
GTGGTGCATC TGGCAAAATC GGCTCAATCA CTGCCGGAGC CCGGTGCTAA TCAGCTAAAG
GTGAGCGGGT ATCTCTGTAT TACCAATCAG AACATTGAGA ATAAACACGA TGAGCGACTG
TTCTTCACCA ATCAGAAGCT CTCTCCGCTT GATTTGCCTG ACACGGTGCG GCAGAAGTAC
GAAGAACTGA TCGCCGACTA TCAAGAGCGT CACCGCGATG AGGTGCGCAA ACGGCGGGAT
CCAAATCGTC CGCAGGGCAC AGAGCCGGCC TTCAGCCGTT TTATTGTCGA AGGCCGCCAA
AAACTAACAG ATGGTGATCT GGTCTATGCC ATGCTCGACA AAGTTGGCAA CAGTGGATAC
AAGGTGCGAT TCATTGTGCC CGTCTCGGTG CCGCGTGTCG GCTTCGAGCG CACCATCGGC
GATCTGCTGT ACCCGTCTGA CCTGAAGAAA TGCGCGACTT ACGATGCGCT CTGCCCTGCG
TGTCGGGTGT TTGGGTGGGT GTGGGGTGAC GAAACTGCCG TTAATCCTCC AGAACTGTCA
GTACGAACGG CGTATGCCGG GCGCGTTAGC TTCAGCCATG CCGTATTGAC CAAAGATGGC
GGAACGTTTG ATGAAACTCT GGCTATTTTG TCTACACCCA AGCCGACCAC CTATCGCTTC
TATTTGCGTC CACGCACCGG CAAACCACAA AATGGACAAG ACGATAGACA AGTAGATTAC
AACAACCAGA ATCAGATCCT GCGCGGGCGC AAGATCTATC GGCATCATGG CGCGCGGCTC
AATCCGCAGG AATACCGGAG TGTTAACGGT GTGAAGAGCG ATCAGAACCG CACGGTGCGT
GGCGTTCAGC AGGCGGGCAG CATCTTTGAG TTCACTGTGG ATTTCGAGAA CCTGGCGCCG
CTGGAGCTTG GCGCACTGCT CTGGAGTTTG CAGATCGAAG GATGGCATCA CCGCATCGGC
TACGCCAAAC CGCTGGGTTT TGGCTCGGCG AAAATTGACA TCGTGAAGAT TTCCTTGCTG
CACCCTGAAG CGCGATACGC CTCGTTCACG AGTAGCGGCT GGCACGATCA GGATCAGCAA
AAAATCGATG CATGGATCAA GGAATTCAAG CAGGCAATGA AGTTACGCTT CGGCGCCGCC
TTCGAGATGT TGGCAAACAT CTGCGACCTC AAGGCATTAC TGGCTGACAC ACCTCCCCTG
CCGGTCCACT ATCCGCGTCC GACCCGCCAA CCGCAGCCGG ATGGCAAACA GTACGAATGG
TTTGTCGGCA ACAAGCGGGG CGGTAACAAT CCCGGACCGC GTATCGCATT GCCACTGGCG
GAGGATGATG TGGCGGGTTT ACCCCTGATT GATAAAAATG GCAATATAAT CCCGTGA
 
Protein sequence
MSKSKTQRRS ISSGTSSAQG YRFLNPYNFV RTLETRNAHI ARLLGRCAPP PHDRYVGLTG 
RIYCRLTATT PIFVADGENV REEHVNGKIH RHYRFFRDPE NKVAIPGTSL RGAIRAIFEA
ATNSCFAHFA GDKRLSYHLL PELALQLVPA RVRKINTRWE LELLPGTTTI TPGQRPAGPQ
YAAWVHVYDP LQKSKTVAQS LSKPYAQRQK LSLTGFAHGE LCHAIIERMI HPRRNFKFWN
VVHLAKSAQS LPEPGANQLK VSGYLCITNQ NIENKHDERL FFTNQKLSPL DLPDTVRQKY
EELIADYQER HRDEVRKRRD PNRPQGTEPA FSRFIVEGRQ KLTDGDLVYA MLDKVGNSGY
KVRFIVPVSV PRVGFERTIG DLLYPSDLKK CATYDALCPA CRVFGWVWGD ETAVNPPELS
VRTAYAGRVS FSHAVLTKDG GTFDETLAIL STPKPTTYRF YLRPRTGKPQ NGQDDRQVDY
NNQNQILRGR KIYRHHGARL NPQEYRSVNG VKSDQNRTVR GVQQAGSIFE FTVDFENLAP
LELGALLWSL QIEGWHHRIG YAKPLGFGSA KIDIVKISLL HPEARYASFT SSGWHDQDQQ
KIDAWIKEFK QAMKLRFGAA FEMLANICDL KALLADTPPL PVHYPRPTRQ PQPDGKQYEW
FVGNKRGGNN PGPRIALPLA EDDVAGLPLI DKNGNIIP