Gene Cagg_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1901 
Symbol 
ID7266392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2332322 
End bp2333533 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content57% 
IMG OID643566738 
Productpeptidase M20 
Protein accessionYP_002463232 
Protein GI219848799 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.296335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.958689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTGA TGCCGCATAG CGAGGGACGG AAAGGAGCGA TGATGAAAGC GGTCGTTCAG 
CATTGGTATA ATCATCCGGT AGTGCGGGAT GCGCTGGCCG ATTTGGCCGA TTATCGGCCG
ACACTGGCGA TGGCGATTGA GATTCAACAG GTACCGGCAC CCACGTTTGC TGAACGACCG
CGATCATTGC TCGTTGCCCA GCGGATGCAG GCACTCGGTT TGCACGATGT GACGATTGAT
GAGTTGGGGA ATGTGTACGC TCGTCGGCCC GGTCACGCCG ATCGCCCCGC TTTGTTGGTA
TCGGCACACC TCGATACGGT CTTTCCTGCT GATACCGACC TTTCCATCCG CTACGAAGGT
GAGCGCGTCT ATGGCCCCGG TATTGGCGAC AATAGCGCCG GGGTAGCCGG CTTGCTGCGG
GTGGCCGAGG TGTACCAGCG CTTTAATCTC CCAACTGCCG GTGATATTTG GTTTGTGGCA
AATGTTGGTG AAGAAGGGTT GGGTGATCTG CGCGGGATGC GCGCCGTGGT CGAACGGCTG
CGCTCGCAGC TCGGTGCAGT CGTCGTGATC GAGGGCTGTG ACTTCGGCTC ACTCCACCAT
CAAGCGATTG GAGTGCGTCG CTTTCGGATC GACGTTACCG GTCCCGGTGG CCATTCATGG
GGTAATTTCG GCACCCCAAG TGCGATCCAC GTGTTAGTCC GGTTGGCGGC ACGTTTGACG
GAATTGCATG TACCGTTGTC ACCGCGCACA ACGTTTAATA TCGGCACCAT CAGCGGCGGT
ACGTCGGTTA ATACGATTGC CCAACATGCC AGTATGTTGC TCGATCTGCG TTCGGTATCG
TCGGCGACGC TGACCGAGCT GGTGAACGAG GTCTATCGGT TGGTCGAGGA GGCTGCGCTC
GAATACCCCG ACATCCACGT GCAATTAGTG AAAGTCGGTG ACCGACCTTC CGGTGCTATT
CCGCGTGAAC ATCCACTGGT ACAAGCTGCC GTTGCGGCCT ACCAGATGGT TGGTGCGCAG
GTTTCGTTCC AGCAGAGTAG TACTGATGCG AATATTCCAC TCAGCATGGG TATTCCGGCG
GTGTGTGTTG GCCTAACCGA TGGTGGTAAT GCGCATCGCA CCGATGAATA TATCATACCG
GTGAATCTGA GTCGCGGGTT ACAGGCATTG TTGTTGTTGC TGCTGGCGGC TGAAGCCATT
GAAGATAGGT GA
 
Protein sequence
MLLMPHSEGR KGAMMKAVVQ HWYNHPVVRD ALADLADYRP TLAMAIEIQQ VPAPTFAERP 
RSLLVAQRMQ ALGLHDVTID ELGNVYARRP GHADRPALLV SAHLDTVFPA DTDLSIRYEG
ERVYGPGIGD NSAGVAGLLR VAEVYQRFNL PTAGDIWFVA NVGEEGLGDL RGMRAVVERL
RSQLGAVVVI EGCDFGSLHH QAIGVRRFRI DVTGPGGHSW GNFGTPSAIH VLVRLAARLT
ELHVPLSPRT TFNIGTISGG TSVNTIAQHA SMLLDLRSVS SATLTELVNE VYRLVEEAAL
EYPDIHVQLV KVGDRPSGAI PREHPLVQAA VAAYQMVGAQ VSFQQSSTDA NIPLSMGIPA
VCVGLTDGGN AHRTDEYIIP VNLSRGLQAL LLLLLAAEAI EDR