Gene Cagg_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0803 
Symbol 
ID7268122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp998031 
End bp999677 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID643565654 
Productglycosyl hydrolase BNR repeat-containing glycosyl hydrolase 
Protein accessionYP_002462163 
Protein GI219847730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GGCTCAGCGC ATTCCTTGCC CTGATCTGCT ACCTGATCGG CTTCAACGTT 
ACACAACTAT CACACGGCCA GAGTATCTGG TCGTTACCGA TTGAGTTGTC ACCGCTGCAA
TACGGCCAAC GACCGCTTGA GCAACTCGAA CGACCGTATG GCTGGTCATG GTTGCCCGAT
ATGACACTCG GCCCTGACGG TAGCGTGCAT GTGGTATGGT ACGGGGGCCT GATTAAGGAC
CAAGGTAATG AAGGTACGGT TGATCTATTG ATGTATCGCC GTCGTAACGC CGATGGCTCG
TGGGAGCCGG TGCGTGAATT GTTCGCACCA GGCGAAGGTG GTTACACGGT TCGCACGAGC
ATAACCCTTG GGCGCGACGG GAATCTCCAT CTCCTTTACC GAGCCGGAAC ACGCATCTTG
TACACGAACG CGAACTGGCG TGGCGCCATC CAACCACATG CGTGGCAACC TGAGAGGGTA
ATCAGCGATA GTGGTTACTA TGTTGCGCTC GCCGCCGATC AGACCGGAGG GTTGCACGCT
TTTTGGAGTG ATATTGTTAC CGAGAACACC AACCCACACT GTTATCGGTG TGGTGAACTC
TTTTACCGAC GCTCGACCGA CAATGGTGTC ACGTGGTCAC CGGTTGTTAA TCTCTCGCGT
ACCGACGAAG GTGATAATCG TCCACAGGTA CGGATTGATA GGTTCAACCG TATTCACATT
GTTTGGGATG TCGGGGCCGA TTGGTACGCC GGGCAAGGAC AACCCCACTA TGGTATGTAC
CGACGTTCGG ATGATGGGGG GCTGACATGG AGCGAACCGG TGCGATTCAG CTTACCACCG
GCTGTGGTAC AAGAAATTCG CCAGCAGCAA AATCAGGTGA CGACAGGCAA TGATGCGCAG
AAACCGCCTT TTGAGGCGGT CCAGCAGACG GCATTGGCGG TTGATGAAGC CGGTAATCCA
TTTGTCGTCT ATCGTGGCGT CCACAACGAT CGTCTCTACT TTCAGCGTTC GCTCGATGGA
GGCAATACGT GGACACCGGC GAGTGAGCTG CCCTATGTGC GGGCGCGTAA TATCACCGAC
AATAACCTTG ATTATTACAG TCTTGCGGCT GATAGCGCCA ATAACATCCA TTTATTAATG
GTGGGGTTTG TAGGAACCAG CACGACCGAT ACCCCACCGG CCCTGATTCA TATGACGTTC
GACGGCACAC GATGGTTATC GCCGCGAATC GTGATGCAGA ACGAGCTATA CCCTGAATTG
CCACGACTGG CGATCTACAA TGGCAATCAA CTGCACGCTG TCTGGTTTAC GCGGTCGAGT
TTGTTTGAAG CTAAGAAGTC GAATAAACGG CCGGTCTATC AAATTTGGTA TAGCACTGCA
CAACTCAACC TACCGGCCCA ACCCGGTATT CCGCTCTTTA CTCCAACACC GGTCACAACC
ACACCAACCG CTGTGGCCGG CGTTGTAGTT ATGCCAACGG CCACCCCTAT TGTGTTACCC
GATGAGATAC GTCACGCACC TGCCTTGCAA GAACCAATGC GCTGGGAGTT GTATGGTTTA
CAGGCGATCG GTATTGCTTT GATATTGACC ATCATCGGTA TTGGCGTCAT CGGTGGACTA
ATCATGATCA GGCGGTCACA CCGATAG
 
Protein sequence
MKRRLSAFLA LICYLIGFNV TQLSHGQSIW SLPIELSPLQ YGQRPLEQLE RPYGWSWLPD 
MTLGPDGSVH VVWYGGLIKD QGNEGTVDLL MYRRRNADGS WEPVRELFAP GEGGYTVRTS
ITLGRDGNLH LLYRAGTRIL YTNANWRGAI QPHAWQPERV ISDSGYYVAL AADQTGGLHA
FWSDIVTENT NPHCYRCGEL FYRRSTDNGV TWSPVVNLSR TDEGDNRPQV RIDRFNRIHI
VWDVGADWYA GQGQPHYGMY RRSDDGGLTW SEPVRFSLPP AVVQEIRQQQ NQVTTGNDAQ
KPPFEAVQQT ALAVDEAGNP FVVYRGVHND RLYFQRSLDG GNTWTPASEL PYVRARNITD
NNLDYYSLAA DSANNIHLLM VGFVGTSTTD TPPALIHMTF DGTRWLSPRI VMQNELYPEL
PRLAIYNGNQ LHAVWFTRSS LFEAKKSNKR PVYQIWYSTA QLNLPAQPGI PLFTPTPVTT
TPTAVAGVVV MPTATPIVLP DEIRHAPALQ EPMRWELYGL QAIGIALILT IIGIGVIGGL
IMIRRSHR