Gene Cagg_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1408 
Symbol 
ID7269240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1733468 
End bp1734904 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content58% 
IMG OID643566251 
ProductPeptidoglycan-binding domain 1 protein 
Protein accessionYP_002462751 
Protein GI219848318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC GTTGGTTGAG CTGGTGGATA GCCCTCATCC TCCTCGCGTC CTGCACATCG 
GCTTCACCCA CGCCAGTACC GTTGTCTACG GCTACACCAT CGGGAGAGCG GCCAACTCGA
ACACCTCACC CCCAACCGAT CACGCCCACG GCAGCTCAAC CAACAGCAAC ACCGATGACG
AATCTACCGG CGCCGGTGTA TCTGCTTCAG AACGATCAGG TGATGCGGCT CGCTCCCAAC
GGTCAACAGC TCACCCAGAT CACTTACGAA CTACAGCCGG TACGCGAACT GAGCGTTGCG
AATAACGGCA CCCTCGTCTA CGTGACCGAA AACGACTTAA TAGCCCTCGA CGGTACGGGC
CGCCGGGTTG TGATCGATGC ACAAACCATT AGCCGCCCGC GGATTTCGCC CGACGGCCAG
CGTGTTGTCT ACCGTCTGGA CGATCCGACA CCCGATCTGC TCGGTACAGA CGCGCCCAGT
GGCGTTTACA TCAGCGAGCT ACGTGGCGGA CAACCACGAC TATTACAGGC CGACGATCCC
GAACCGGTCA CACCGGATTT CTCCAAGCCG GCGTGGCGCT ACATACCGGT GTCATGGTCT
CCCGATGGTC AGCGGTTGCT ATTATACGCC GTGATGCAAC CGGAAATTGG CATTCCCGGC
GGTGAAGTGG TCATTATCGG GCCAGACGAT CGGGTTGTCC GTGCCTTCAG TTGTTGTGAA
GAAGAGATAT GGAGTGTTAA TGGAGACGAA TTGACCGTGG CTGGCGGTGG ACCCGATCCT
GATCTACGTT TCGGTCTGTA TCGGATCGAT GTCGAGCATG GCACCGAATC GCCGGTAGTA
GCAAGGAGTG AGGCGATCAT CCCCCTCGTG CGCGCACCAC AACGGATGGC GGATGGAGAC
ATCTATGCCT TTATCGAGCT GGTGCCAGTA CAAGCGTATC AATGGGATTA TCCTTTTCGC
CCCCAGATGG TACGTGTCGG CAATGATGGC ATCATAACGC CAATACGGCC TGAACGGCTT
GGCGAACCGC TGTTGAACTT GTGGAACCCA CAAGCACGCG GTGCTTTGGT TCAATTCGCC
GAACAAGCGA ACCTGATCTG GTTACCAACC GATCCCACCC TACCGATTGT CACAACCGTC
GCCAACGGTG TTGCCGCTGC GTGGACACCT ACAGCCGACT TGAGCATCCG CCCTTGTGAC
GGCTTTGCGA CCATTTCACC CCAACCCGCC AACCACCGTC AGTTCGATCC GGCGGTTGCC
GATCTCCAAG GACGGCTCGC CACTCTCGGC TTCGATCCCG GACCCATTGA CGGTCTATTC
GGCCCGACTA CTGCTACCGC CGTGCAGGCT TTTCGCATTG CTACCGGTTT GCCGGCAGGC
GACAGTATTG ATTGCGTAGC CTGGCAAAGC CTCTTAACCC GGAGTACTGC TCAATGA
 
Protein sequence
MKQRWLSWWI ALILLASCTS ASPTPVPLST ATPSGERPTR TPHPQPITPT AAQPTATPMT 
NLPAPVYLLQ NDQVMRLAPN GQQLTQITYE LQPVRELSVA NNGTLVYVTE NDLIALDGTG
RRVVIDAQTI SRPRISPDGQ RVVYRLDDPT PDLLGTDAPS GVYISELRGG QPRLLQADDP
EPVTPDFSKP AWRYIPVSWS PDGQRLLLYA VMQPEIGIPG GEVVIIGPDD RVVRAFSCCE
EEIWSVNGDE LTVAGGGPDP DLRFGLYRID VEHGTESPVV ARSEAIIPLV RAPQRMADGD
IYAFIELVPV QAYQWDYPFR PQMVRVGNDG IITPIRPERL GEPLLNLWNP QARGALVQFA
EQANLIWLPT DPTLPIVTTV ANGVAAAWTP TADLSIRPCD GFATISPQPA NHRQFDPAVA
DLQGRLATLG FDPGPIDGLF GPTTATAVQA FRIATGLPAG DSIDCVAWQS LLTRSTAQ