Gene Cagg_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2781 
Symbol 
ID7269851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3418308 
End bp3419678 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content56% 
IMG OID643567602 
Productbeta-galactosidase 
Protein accessionYP_002464080 
Protein GI219849647 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATG CAATCCGCTT TCCGACGAAC TTTATCTGGG GCGCAGCCAC CGCTGCCTAT 
CAGATTGAAG GTGCCTGGAA CGAAGATGGA AAAGGCGAGA GTATTTGGGA TCGCTTTGTC
CGCCGACCCG GTGCCATTGC CGATGGTAGT ACCGGTGATG TCGCCTGTGA CCACTATCAC
CGGTACGAAG AAGACCTCGA ACATATGGCA GCAATGGGAC TGAAGGCGTA CCGTTTCAGC
ATCGCATGGC CGCGCATCTT CCCCGACGGC ACCGGCCAAC CCAATCAGCG CGGGCTTGAT
TTTTATCGCC GACTCATTGA CGGTTTGCAC CGACGTAGGA TTCTCCCGGT TGCAACTCTC
TACCATTGGG ATCTGCCGCA AGCAATTGAA GATCGCGGCG GCTGGATCAA CCGAGATACG
GCTTTTTATT TTGCCGAATA CGCCGATTAT CTCTTTCGCC AGATCGGTGG CGATGTTGCG
CTCTGGGCTA CCCACAACGA GCCATTTATA CAGGCCTTCT ACGGCTACGG CAATGGTGAA
AATGCGCCCG GTAAGCGAGT GCCGTGGCGA GTATTGCACG TCGTACACCA TCTTTTGTTG
TCACACGGGC TGGCAGTGAG CGCTTTTCGC GCCACCAAGC CGCAACCGGT ACGCGCCGAT
CTACCATCAC CCCAGATCGG GATTGTCCTT ATGATCTGGC CGCAGTATCC GGCCTCTGAT
CATCCTGCTG ATCTTAAAGC TGCTCAGCGC ATCGACGGAG CAATGAACCG GCTCTTCCTC
GAACCGCTGT TCCGCCGGCG CTACCCCGCC GATCTAGTAG CACACTTTGC TCGTCGGCTC
ATCTTCGCGC CGGTCAAGCC CGGTGATATG GAGATTATCG GCCAGCCGAT CGATTTTCTC
GGTATTAACA CCTACACGCG GCTCTTCAAT GCGGTGAACT GGCGCGAACC GTTTTTAATG
ACTAAGCAGG TGCCGGGGCC GCTCCCCAAA ACGGCGATGG GCTGGGAGAT ATACCCCGAT
TGTATCGTCG AGGCGTTGCA GAAGGCACGT GAGTATACGT CGCTTCCGCT GTACATCACC
GAAAACGGCG CAGCGTTCGA CGATCCGCCA CCCGGCCCCA ACGATCAGAT CGTTGAAGAC
CCGGATCGTG TTGCTTACCT CCGCTCTCAC ATTGCGGCTT GCCATCGCGC ACTGACCGCC
GGGATCGATC TACGCGGTTA TTTTGTCTGG ACACTGATGG ACAATTTTGA GTGGGCGAAA
GGGCTGAGCA AGCGGTTTGG GATTATTTAT ACCGATTATG CCACTCAACG CCGGGTGTGG
AAACGCAGCG CGCACGTGTA CCGCGACATT ATCGCGCGCA ATGGGTTGTG A
 
Protein sequence
MNHAIRFPTN FIWGAATAAY QIEGAWNEDG KGESIWDRFV RRPGAIADGS TGDVACDHYH 
RYEEDLEHMA AMGLKAYRFS IAWPRIFPDG TGQPNQRGLD FYRRLIDGLH RRRILPVATL
YHWDLPQAIE DRGGWINRDT AFYFAEYADY LFRQIGGDVA LWATHNEPFI QAFYGYGNGE
NAPGKRVPWR VLHVVHHLLL SHGLAVSAFR ATKPQPVRAD LPSPQIGIVL MIWPQYPASD
HPADLKAAQR IDGAMNRLFL EPLFRRRYPA DLVAHFARRL IFAPVKPGDM EIIGQPIDFL
GINTYTRLFN AVNWREPFLM TKQVPGPLPK TAMGWEIYPD CIVEALQKAR EYTSLPLYIT
ENGAAFDDPP PGPNDQIVED PDRVAYLRSH IAACHRALTA GIDLRGYFVW TLMDNFEWAK
GLSKRFGIIY TDYATQRRVW KRSAHVYRDI IARNGL