Gene Cagg_1344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1344 
Symbol 
ID7268636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1663925 
End bp1665814 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content59% 
IMG OID643566187 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002462687 
Protein GI219848254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000317617 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCAGA TCGCACCCTA CGGCAGTTGG CGCTCGCCGA TCACCGCCGC GATGCTCGTC 
ACCAAACAGG TAAGCCTCAG TTGGCCGCAA ATTGATGGTT CTGACCTCTA CTGGATCGAA
GGCCGACCAC AAGAGGGTGG TCGTAATGTG CTCGTTCGAC GCACTGCTGC CGGCACGATT
GACGATGTGA CGCCGACCGG GATGAATGTA CGCACGTTAG TACACGAATA CGGTGGTCGG
AGTTATGTCG TTGATCAAGG TGAGATCTTC TTCGTGGAGT TCAGCGATCA GACGCTTTAC
CGCCAACACC CCGGCGCCGC ACCGGTTCCG CTTGGCACAC CGCCCGGGCT GCGGTTCGCC
GAGCCGATAG TTGATCGCCG ACGTAACCGG TTGATCGCCG TTGGCGAAGA CCACCGGAGC
AGTGGTGAAC CGCTCAACAC GCTCGTCGCT ATCAGTCTGG CCGATGGGAC AGTCACGCCA
TTACTGAATG GACGCGATTT TGTGGCTGCA CCACGCCTCA GCCCTAACGG TGCGTGGCTG
GCCTGGTTAG CGTGGGATCA TCCCAATATG CCGTGGGACG CTGCTGAGTT ATGGGTAGCG
CCGGTGCTGA CCGATGGCAG TCTAGGTGCC GCCCGTCGTT TAGCCGGTGG TCCCGGCGAC
TCGTGCTTCC AGCCGGAATG GACGCCCGAT GGCGCATTAC TGGTCGTCGC CGAACGTACC
GGCTGGTGGA ACCTCTACCG CCTCGACCTC GATGGCAACA CCACCCCACT CTACCCGCTT
GATGCCGAGT TTGGTCAACC ACTCTGGCAA CTGGGTATGC GCACGTATGT TCCACTTCCC
GACGGACGCA TCGTAGCGAC CTTCAGCCGC GAGAGTCGCC GGCACATGTG CGTGATCGGG
CAGCCGGGTC ACGCCGAACC CATCGAGTTA CCGGTTTCGG TAATCAATAT CGTCAACGGT
GACGGCGAAC GGATCGTCTT TGTCGGCAGT TCGCCGACGA TGCCGGCAAC GCTCTTCTTG
CTTAACCTCG CCGATCGATC GCTGACACCG ATCCGCAGCA GTGGCGATGT GCCGGTCGAT
CTATCGTACA TCTCACAGCC AGAGGTGATC AGCTTTCCGA GTGCCGGTGG GCGGATTGCG
CACGGCATTT TCTATCCACC TCACAACCCC GATTTCAGCG CACCTGACGG TGAACTACCC
CCATTGCTGG TAATGATCCA CGGTGGGCCA ACCGCTGCCA CTTATCCCAC CTTACGCCTC
TCGATCCAAT ACTGGACGAG TCGGGGGATC GGTGTACTCG ATGTGAACTA CGGCGGCAGT
ACCGGTTTTG GTCGTACTTA TCGTGAACTG CTCGACGGAC AGTGGGGTGT GGTTGATGTA
GAAGATTGTG TTGCCGGTGC CCGGTTTCTT GCCGCTGAGG GCAAAGCCGA TCCCAACCGG
CTCTTGATTA CCGGCGGCAG TGCCGGCGGA TTCACAACAT TAGCTGCATT AGCTTTCCAC
AATACCTTCC GCGCCGGCGC CAGTCATTTT GGTGTTGCCG ATCTCGCAGC CCTAGCGCGT
GACACCCACA AATTCGAATC ACGGTATCTC GACCGTCTGA TCGGCCCATA CCCCGCACGA
GCTGACCTCT ATCAGGCACG TTCACCGCTC TACCATGCCG ATCGGATCAA CAGCCCGGTT
ATCTTCTTCC AAGGGCTAGA AGACAAAGTG GTGCCGCCAG ATCAATCGGA GCGTATGTAC
GAAGCGCTTC GCTCACGCGG TATTCGGACC GAATACGTAC CCTTTGCCGG CGAACAGCAC
GGTTTTCGCA AGGCCGAGAA CATTATCACA GCTCTCGAAC GTGAGCTAGC GTTTTACCAA
GAAGTGCTAG GGATCCACGC TGAGCGGTGA
 
Protein sequence
MPQIAPYGSW RSPITAAMLV TKQVSLSWPQ IDGSDLYWIE GRPQEGGRNV LVRRTAAGTI 
DDVTPTGMNV RTLVHEYGGR SYVVDQGEIF FVEFSDQTLY RQHPGAAPVP LGTPPGLRFA
EPIVDRRRNR LIAVGEDHRS SGEPLNTLVA ISLADGTVTP LLNGRDFVAA PRLSPNGAWL
AWLAWDHPNM PWDAAELWVA PVLTDGSLGA ARRLAGGPGD SCFQPEWTPD GALLVVAERT
GWWNLYRLDL DGNTTPLYPL DAEFGQPLWQ LGMRTYVPLP DGRIVATFSR ESRRHMCVIG
QPGHAEPIEL PVSVINIVNG DGERIVFVGS SPTMPATLFL LNLADRSLTP IRSSGDVPVD
LSYISQPEVI SFPSAGGRIA HGIFYPPHNP DFSAPDGELP PLLVMIHGGP TAATYPTLRL
SIQYWTSRGI GVLDVNYGGS TGFGRTYREL LDGQWGVVDV EDCVAGARFL AAEGKADPNR
LLITGGSAGG FTTLAALAFH NTFRAGASHF GVADLAALAR DTHKFESRYL DRLIGPYPAR
ADLYQARSPL YHADRINSPV IFFQGLEDKV VPPDQSERMY EALRSRGIRT EYVPFAGEQH
GFRKAENIIT ALERELAFYQ EVLGIHAER