Gene Cagg_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2943 
Symbol 
ID7268816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3605763 
End bp3606734 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content58% 
IMG OID643567765 
Productpeptidase S58 DmpA 
Protein accessionYP_002464239 
Protein GI219849806 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.141829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAC CGGCTATTAC CGATGTGAAA GGCATATTGG TCGGTCACGC TACCGATACT 
GCTGCATTGA CCGGTGTGAC GGTTGTGTTG ACACCAGATG GTGCAACCGC AGGTGTTGAT
GTGCGCGGTG GTGCTCCTGG CACCCGTGAG ACGGATCTGC TCAATCCGGT GAATCTGGTT
CAGCAGGTTA ATGCGGTGGT GTTGACAGGT GGCAGTGCGT TTGGTTTGGC CGCCGCCACA
GGAGTGGTGT CATGGCTCTA CGAGCGGGGA TACGGTTTTG ATGTTGGCCT CACCAAAGTG
CCAATTGTCC CGGCAGCAGT GATCTTCGAT CTCGGTATCG GACGTGCTGA TGTATGGCCC
GATGCGACAA TGGGCTATGC AGCTTGTGTT GCGGCTGATG CGAGTGTGGC TGAAGGTAGT
GTTGGCGCCG GGATCGGAGC AACTGTGGGT AAAGTTGGTG GTATGAACAC GGCAATGCGG
GGTGGTGTAG GGACGTGGAG TGAAACGCTT GCCGATGGGG TAACGGTTGG CGCATTGGTG
GTCGTGAATG CGTTTGGTGA TGTGGTTGAT CCACAAGGAC GGATCATCGC CGGCGCACGC
GGCCCGGGTC AAACGTTGGT AGGTACCGGC GCGTTGTTGC GTGGAGGGTT GAGCCGACAA
TCATTCGCTG ATACCACCGG CCAACATACG ACGATTGCGG TTGTGGCAAC CAATGCTCGA
TTGGATAAAG CTGCGGCAAC CCGCCTGGCG ATTATGGCAC AGGATGGGCT GGCCCGCGCT
ATTCGTCCGG CTCATTCGCC GTTTGATGGT GATACCGTGT TTGCCCTTTC TACCGATGTC
TACGAAGCTC CCCCTTTGGT AACGCTCGGT GCTGTAGCTG CCGATGTGCT TGCGATCGCG
ATTGTGCGTG CTGTCCAAGC TGCAACGACG GTAGCCGGTG TTCCGGCAGC GCGTGATGTG
CCGTCAGCCT GA
 
Protein sequence
MKVPAITDVK GILVGHATDT AALTGVTVVL TPDGATAGVD VRGGAPGTRE TDLLNPVNLV 
QQVNAVVLTG GSAFGLAAAT GVVSWLYERG YGFDVGLTKV PIVPAAVIFD LGIGRADVWP
DATMGYAACV AADASVAEGS VGAGIGATVG KVGGMNTAMR GGVGTWSETL ADGVTVGALV
VVNAFGDVVD PQGRIIAGAR GPGQTLVGTG ALLRGGLSRQ SFADTTGQHT TIAVVATNAR
LDKAAATRLA IMAQDGLARA IRPAHSPFDG DTVFALSTDV YEAPPLVTLG AVAADVLAIA
IVRAVQAATT VAGVPAARDV PSA