Gene Cagg_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1016 
Symbol 
ID7268388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1254659 
End bp1256107 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content56% 
IMG OID643565862 
Productleucyl aminopeptidase 
Protein accessionYP_002462367 
Protein GI219847934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATGCG AAGTTCAGAG CGGTGAACTG TTGGCTACCG AGACGGAGTT AGCCGTTTTG 
CTGTACGCCG AGGACGAAAC ATTACCTGCG GAAGTTACTG CGTTGTGCGA ACCTAACGAT
GCCAACGGAC GCTGGAAGCA GCAAACGTTA CTCTACCCAC GCGGAGCGTT GCCGGCGCGA
CGGTTGTTGC TGATCGGGAT GGGGAAGCGT TCGGGTATTA CTGCCGATAC AGTGCGGCAA
GCGGCAGCAA TTGCGGCCCA ACGGGCGCAA GAATTGAAGG TGTCGAGGTA TCATCTGGGC
TATAATGGTC ATCTGCCATT GACACCACAA CAATTTGGTG TAGCTTTTGC CGAGGGTAGC
ATCTTGGGGT CGTACCGATA TACACGCTAT AAGAGTGACC GGGAGGAGAC GCCGGAGATT
ACGGCCATCT TGCAAGCTGG TTCTGATGCG CCGGAAGCCG TTGAGGGGGT ACGGCGTGGT
CAGATACTGG CGCAGGCGAC GACGTTTGCC CGCGATCTCG CGAACGGTCC CGGTAATGAG
GTGACACCGG CATTTCTAGG TCAGACTGCT GTTGAGATGG GTGCTCGCCT TGGTTTGCAG
GTCACCGTGC TGGATAAAGC ACAATTGATC GAACAAGGTT TTGGCGGTAT TCTCGCCGTT
GGGCAAGGAT CGGCCAACGA ACCACGCTTT ATCGTGATGG AATATGGTAC TGCCGGGCGC
GGTCCAACCA TTTGTCTCGT CGGTAAGGGT ATAACCTTCG ATACCGGAGG TATCAGTATC
AAACCGGCCG AGAAGATGGA CGATATGAAG ATGGATATGA GTGGTGCGGC GGCAGTTTTC
GGAGCTATGC AAGCGGTAGC CGAATTGCAG TTACCGTTGC ACGTGGTTGG TATTGTCTGT
GCTGCCGAGA ATATGCCGAG CGGTACCGCT TATCGTCCCG GCGACATTAT TCGCACCCTC
AGTGGTAAGA CGGTTGAGGT GCTTAATACC GATGCTGAAG GTCGGATCGT GTTGGCAGAC
GGCCTCTTTT ACGCACAACG CTATCAGCCG GCGGCGATTG TTGATTTGGC AACCCTGACC
GGTGCGATCA TGGTGGCCCT TGGTTCGCAT GCGATCGGTT TAATGGGGAA CAACCAGGAA
TTGGCAAATC GGCTCATTAC TGCCGGTGAA GCGACTGCCG AGCGCGTTTG GCAGTTGCCG
CTCTGGGAGG AGTACCGTGA TGCGATGAAG AGTGACATTG CCGATCTCAA AAATACCGGT
GGGCGTTACG GCGGCGCGAT TACGGCTGCC GGTTTTCTGG CCGCTTTTGT CGGTGATTAT
CCGTGGGCGC ACCTCGATAT TGCCGGTACT GCGTGGGTTG AGAAGCCGTC GCGCGCTTAT
CAATCACGTG GGGCGACCGG GGTTGGTGTA CGGTTGTTGG TTGAGCTGTT ACAGGGATAT
GTGAGTTGA
 
Protein sequence
MQCEVQSGEL LATETELAVL LYAEDETLPA EVTALCEPND ANGRWKQQTL LYPRGALPAR 
RLLLIGMGKR SGITADTVRQ AAAIAAQRAQ ELKVSRYHLG YNGHLPLTPQ QFGVAFAEGS
ILGSYRYTRY KSDREETPEI TAILQAGSDA PEAVEGVRRG QILAQATTFA RDLANGPGNE
VTPAFLGQTA VEMGARLGLQ VTVLDKAQLI EQGFGGILAV GQGSANEPRF IVMEYGTAGR
GPTICLVGKG ITFDTGGISI KPAEKMDDMK MDMSGAAAVF GAMQAVAELQ LPLHVVGIVC
AAENMPSGTA YRPGDIIRTL SGKTVEVLNT DAEGRIVLAD GLFYAQRYQP AAIVDLATLT
GAIMVALGSH AIGLMGNNQE LANRLITAGE ATAERVWQLP LWEEYRDAMK SDIADLKNTG
GRYGGAITAA GFLAAFVGDY PWAHLDIAGT AWVEKPSRAY QSRGATGVGV RLLVELLQGY
VS