Gene Cagg_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2801 
Symbol 
ID7268670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3438932 
End bp3440452 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content56% 
IMG OID643567622 
ProductPeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_002464100 
Protein GI219849667 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.402753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGA GTATGATACG CTTCATACTG CTTATTGTCA TCACCCTGAC CGGTTCCGGC 
TCGTGGATAC AACCGGCCAT CTTTCGCGAC GAACCGGCCA TTGCTACCGC GGTCGTCGAT
CCATTCATTA CAGCGCAAAC CATTGCTATG CGGCCGGATT ATGTTACCGA TCTCACCTAT
CAAGACTGGG ATCGCTACAC GATACAGATT CAGCTCGATC CGGCGGCATT ACAACTCAGT
GGCAATCTAA CCGTGCGGCT TACCAATCGC ACAACGGTCG ATTTCGATAC GATCTGGTTT
CATCTCTACC CCAATCACCC CGATTTTGGC GGCAGGCTCG ATGTGACGTC GGCTCGGATT
GATGACATTC CCGTCTCTTC ACGTACCCTC CATAACGACA CACTGATCGG GTTGCAGACC
CCACACCCCA TTGCTCCCGG CCAGAGTACT ACCGTGACCA TGACCTTCAC CGCCCGCACC
CCGCGTAACG CCAGCCGGAA GAGTTTCGGC GCCTATAATC TTGAGGCGGG GGTATGGTCA
ATCGCTTCGT TTTATCCAAT GCTGGCCCGC TACATCGAAG GCATCGGTTG GGACACCCGT
CCAATTGTGT CACGCGGTGA TTTTACGGTG AGTGCGATTG CGCTGTACGA TGTCACGGTT
GAAGCACCGG TAGATTGGCA ATTGGTGACG AGCGGCAGCC AACTCGAACA ACACATACTC
GCCGACGGTC GCCAACGGGT ACGTTTTGTC AGCGGACCAC AGCGCGAGTT CTACCTCGCA
GCCCTACAGG GGCTGGTAGC AACCAGCGCC GACGTTGACG GCACAAGGGT CATTAGTTAC
GTCCAGGCCA ACGATCCTGA CGCCGGTGCC CGTAGTCTGG CCATCGCGAC CACAGCTTTG
CGCATCTTCA ACCAACGATT TGGTGCGTAT CCGTATGCCG AGTTCGAGAT CATCCAAGCG
GCGCTGACCC AATTTTATGG AATGGAATAT CCGGGTGTCG TGCTGATCGA ACAGCGCCTA
TACCAACGCA ACGACCATCT GCTCGAAACG ACTATTGCCC ATGAGATCGG TCATCAATGG
TGGTATGGTC TTGTCGGCAA CGATGCACAA GGTGAAGCCT GGCTCGATGA GGGTTTGGCC
AGTTACAGTC AAATCCTGTA TTATGAGATG ATCGATAACC TCACCCAAGC TCAAGCCGAA
CTAGACGCCT TCCGCGCCGC TTATCGGCGA CTGCGCGAAC GTGGCGGTGA TGCCCCATTA
GCGACACCGC CATCGGAGTT AGGCAACGGC CGCTACGTGC CGGTTGTCTA CGCCAAAGGG
GCACTCTTCT TTCACGCCCT CCGCCAACAG ATCGGCGAAG CAGCATTCAA TGACTTCTTG
CAAGGGTACG TTACGGCTGC TCGCTATCGC GAAATCGCCG GTCCTGATCT CCTCCGTGCT
GCCGAAGAGG CCTGCGCCTG TACCCTGGAT GCAATGTTTC ACAACTGGGT CATCACGGCG
GAGCCGGTAA CGATACCGTG A
 
Protein sequence
MIKSMIRFIL LIVITLTGSG SWIQPAIFRD EPAIATAVVD PFITAQTIAM RPDYVTDLTY 
QDWDRYTIQI QLDPAALQLS GNLTVRLTNR TTVDFDTIWF HLYPNHPDFG GRLDVTSARI
DDIPVSSRTL HNDTLIGLQT PHPIAPGQST TVTMTFTART PRNASRKSFG AYNLEAGVWS
IASFYPMLAR YIEGIGWDTR PIVSRGDFTV SAIALYDVTV EAPVDWQLVT SGSQLEQHIL
ADGRQRVRFV SGPQREFYLA ALQGLVATSA DVDGTRVISY VQANDPDAGA RSLAIATTAL
RIFNQRFGAY PYAEFEIIQA ALTQFYGMEY PGVVLIEQRL YQRNDHLLET TIAHEIGHQW
WYGLVGNDAQ GEAWLDEGLA SYSQILYYEM IDNLTQAQAE LDAFRAAYRR LRERGGDAPL
ATPPSELGNG RYVPVVYAKG ALFFHALRQQ IGEAAFNDFL QGYVTAARYR EIAGPDLLRA
AEEACACTLD AMFHNWVITA EPVTIP