Gene Cagg_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3038 
Symbol 
ID7266569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3693419 
End bp3694639 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID643567859 
Productaminotransferase class I and II 
Protein accessionYP_002464333 
Protein GI219849900 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000279526 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGCCA AGATGGAATA TCGCTTTGCT CGCCGTCTCG CTGCACTTGA AGCCTCGGCT 
ACCGCCGCTA TGACGGCTCG CGTCGCTCAA ATGCGTGCCG CCGGGATCAA CGTTATCTCA
TTTAGTGTAG GGGAACCTGA TTTCGATACC CCCGAACCGA TCAAGCAAGC GGCCATTGCC
GGTATCCACG CCAACCACAC CCATTACACT CCTACCGGCG GTACGCTCGA ACTGCGCAAG
GCTGTCGCCG CCCGCGTCAC TGCCGACCAA GGCATTACTT ACGGTCTCGG TCAGGTAACA
GTAACGACCG GCGCAAAAGA GGCTCTCTAT CTCGCCTTTC AGGCGCTGTG CGACGAGGGT
GATGAAGCGA TTATCCCTGC CCCATACTGG GTAAGTTATG TTGAGCAGGC TAAGTTAGCC
GGTGCTACCC CGATTACACC ACAAACAACC GAACAGACCG GTTTTAAGCT GACACCTGAT
CAACTACGGG CGAACCTGAG TGAACGCACT CGAGTCGTTG TGCTCAACTC GCCATCAAAC
CCAACCGGTG CGGTCTACAG CGCCGAGGAA CTGGCAGCAC TGGCCGAGGT GCTACGTGAT
CATCCGGCCA TTATTATCAC CGACGAGATT TACGATGCAA TCTCGTATGT TCCCTACGCA
CGCCTGTTGC GCATTGCCCC CGATTTAGCC GAACGCACCT TAGTCGTCAA CGGTGCTGCT
AAAGCGTATG CAATGACCGG CTGGCGCGTC GGTTATGTCG CCGGTCCGCA ACCGATCATC
GAGGCAATTA AGGCTATTCA GAGCCATACC AGCACCCACA CATCGAGCAT CTCGCAAGAT
GCAGCACTGG CGGCCTACAC GCCCAACCCG GCGGTCGAAG CTGCGGTTAC GGCAATGACC
GCCGAATTCC ACCGCCGCCG CGATCTTATT CTTGAACTGT TGGCAACAAT CCCCGGCATA
ACCTGCACTG TGCCAGACGG TGCGTTTTAC GTATTCCCCA ACGTCAGTGC CCTGCTCCAT
CGCCCATTGC GCAACGGTAA GATTTGCACG ACCAGCGAGG AGCTAAACCT ATACTTGCTC
GAAGAGGCGC ATATCGCCTG TGTCGCCGGT GAAGCCTTCG GCGCGCCCGG CTACCTGCGC
CTGTCATACG CTACCGGTAG TGAAGATATC CGGATCGGTA TGCAGCGCTT CCGTGAAGCA
GTGCTTGTCG ACCACGCCTA A
 
Protein sequence
MSAKMEYRFA RRLAALEASA TAAMTARVAQ MRAAGINVIS FSVGEPDFDT PEPIKQAAIA 
GIHANHTHYT PTGGTLELRK AVAARVTADQ GITYGLGQVT VTTGAKEALY LAFQALCDEG
DEAIIPAPYW VSYVEQAKLA GATPITPQTT EQTGFKLTPD QLRANLSERT RVVVLNSPSN
PTGAVYSAEE LAALAEVLRD HPAIIITDEI YDAISYVPYA RLLRIAPDLA ERTLVVNGAA
KAYAMTGWRV GYVAGPQPII EAIKAIQSHT STHTSSISQD AALAAYTPNP AVEAAVTAMT
AEFHRRRDLI LELLATIPGI TCTVPDGAFY VFPNVSALLH RPLRNGKICT TSEELNLYLL
EEAHIACVAG EAFGAPGYLR LSYATGSEDI RIGMQRFREA VLVDHA