Gene Aasi_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1069 
Symbol 
ID6376876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1381115 
End bp1382110 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content39% 
IMG OID642682182 
Producthypothetical protein 
Protein accessionYP_001958143 
Protein GI189502426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.857082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA TTAGAGTAGC CATTAACGGC TTTGGTAGAA TAGGCAGATT AAGCTTAAGA 
GCTTTATTAC AAAAATCAAA CATAGAAGTA ATAGCCATCA ATGATCTTAC GGATGCCTCT
ACCCTAGCAC ACTTATTTAA ATATGATTCT AATTATGGAA GGTTTGCAGG CCATGTAACA
GCGGATGCAA ACCATCTTCT TGTAAATAGC AAGAGAATTA CTGTATTAGC AGAACCTGAC
CCTACCAAGC TACCATGGGA AAAGCTACAA ATAGATGTTG TATTAGAAGC TACTGGAAGA
TTCTTAGATA AGGCTAGCAA TGAGCAGCAT ATAACTTCGG GTGCTAAACG TGTAGTTATT
TCAGCACCGG CAAGCAATGA TATCCCTACT ATAGTGTTAG GAGTTAATGA GAATATTTTA
TCTACGGCTG GTCCTATCAT TTCAAATGCT TCTTGTACAA CAAATTGTTT AGCACCTGTG
GCATACGTAC TAGATAAGTA TTTTGGTATT GAAAAAGGTT ATATTAATAC CATTCACGCT
TATACGGCCG ACCAACGTTT ACAAGATGCG CCCCATAAAG ATTTACGGCG TGCTAGGGCA
GCTGCAAAAT CTATTATCCC TACTACTACA GGAGCTGCAA AATCTATAGG TACTGTTTTA
CCACAGCTTC AAGGGAAATT AGATGGTATT GCTATGCGTG TACCTGTCGC AGATGGCTCT
ATATTGGACT TAACAGCTAT TCTTCGGCAG CCAGTTACCA AGAAAATGAT TAATACTGCC
ATGAAGCAAG CTGCTGATGG AGCTATGCAA GGCATACTTG AATATACAGA AGATCCTATT
GTTTCTGTAG ATGTGATTGG CAATCCACAC TCCTGTATTT TTGATGCGCA GCTTACTTAT
ACACAAGGGA ACCTAGTAAA GGTAGTAGGT TGGTATGACA ATGAAGGCGG CTATGCACAT
CGTATAGCAG ATTTAATAAG TAAGCTCGGC AGATAA
 
Protein sequence
MEQIRVAING FGRIGRLSLR ALLQKSNIEV IAINDLTDAS TLAHLFKYDS NYGRFAGHVT 
ADANHLLVNS KRITVLAEPD PTKLPWEKLQ IDVVLEATGR FLDKASNEQH ITSGAKRVVI
SAPASNDIPT IVLGVNENIL STAGPIISNA SCTTNCLAPV AYVLDKYFGI EKGYINTIHA
YTADQRLQDA PHKDLRRARA AAKSIIPTTT GAAKSIGTVL PQLQGKLDGI AMRVPVADGS
ILDLTAILRQ PVTKKMINTA MKQAADGAMQ GILEYTEDPI VSVDVIGNPH SCIFDAQLTY
TQGNLVKVVG WYDNEGGYAH RIADLISKLG R