Gene Aasi_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1420 
Symbol 
ID6377522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1822407 
End bp1823792 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content37% 
IMG OID642682491 
Producthypothetical protein 
Protein accessionYP_001958441 
Protein GI189502724 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.735378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGCTA GTGGGAGCAA AGAAGTGCTT ACACAATTGA TTGAAGCTGG CGCAGGTATT 
ATCCGCTTAA ATTTCTCACA TGGCACCTAT ACAGCTCATC AACGAGTCAT AGAACATGTG
CGCACTATTA ACCAAGAGTT AGGCAAACAT GTTTGTTTAC TACAAGACTT ACAGGGGCCT
AAGATTAGAA TTGGTTTATT AAAAGAGGAA ACTATACATC TAGTTAGTGG ACAAGAATTA
GCACTTACAC CTGAAGTAAT ATTAGGTACT GAAAACCGAA TAACAACCAC ATACACTAAC
TTAGCACATG AAATAAAAAT TGGTGATACT ATTTTGGTAG ATGATGGTAA AATTGTATTA
AAAGCTATAC GTAAAGAAGG AAACGAGCTA ATTACAGAAG TAATTCATGG TGGCGATTTA
CGTTCCAATA AAGGACTTAA TTTGCCAGCT ACTAGGCTCT CTACGCCCTC CCTTACAGAA
AAAGACCGGG AAGATTTAGC TTTTGGGCTT CAACAAGATG TAGAATGGAT AGCTCTTTCT
TTTGTTAGAA ATCCGCAAGA TATTATAGAG CTCAAAGAAA TCATCAAACA ATCTGGCAAG
AACACTAAAG TAATTGCTAA AATAGAAAAG CCAGAAGCAT TAGAACATAT TCAGGAAATT
ATAGCAGCAG CAGATGCCCT TATGGTAGCC CGAGGAGATT TAGGTGTAGA AATTGCTATG
GAAAAAGTAC CTATGGTACA AAAGAATATA GTAAGCTTAT GTAACCGAGC TGGCAAACCT
GTTATCATAG CTACCCAGAT GATGGAGAGT ATGATAGAAA ATCCTCTGCC TACCCGGGCT
GAAACTAATG ACATAGCCAA TGCAGTTATA GATGGTGCTG ATGCTTTAAT GCTTTCTGGT
GAGACAGCAT TAGGAAAATA TCCTATTAAA GTAGTTGCCG AAATGAAAAA GACTATTTTA
GTTGTTGAAC AAACTGCACC TATCTACAAT AGATACCAAG ATATCTCATC CACATCTCCA
ACCTTCTATA ATGATAGTTT GGTCAGAACA GCCTGTAGGC TAAGTCATGA TATTCAGGCT
AAAGCTATTA TATGCTTAAC ACAAACAGGA TGGACAGCTT TAGAGCTAGC CAAGCATAGG
CCTCAGGCAA ATATATTTGT CTTTACAGAT AATCAGTCTC TACTTAACAG CATCAACTTG
ATTTGGAACG TAAGAGGATA TTATTATGAT AGTATGGTGT CTACAGACCA GACATTTGCT
GATATCGAAT CTCTTTTGAC AAAAAACAAT TACTTAAAGT CAGGCGATGT ATTTATTAGT
ATGGCTAGTA TGCCTATTCA TAGTAAACAA CGTACCAATA TGTTAAAGAT TAATAGAGTA
TCTTAA
 
Protein sequence
MPASGSKEVL TQLIEAGAGI IRLNFSHGTY TAHQRVIEHV RTINQELGKH VCLLQDLQGP 
KIRIGLLKEE TIHLVSGQEL ALTPEVILGT ENRITTTYTN LAHEIKIGDT ILVDDGKIVL
KAIRKEGNEL ITEVIHGGDL RSNKGLNLPA TRLSTPSLTE KDREDLAFGL QQDVEWIALS
FVRNPQDIIE LKEIIKQSGK NTKVIAKIEK PEALEHIQEI IAAADALMVA RGDLGVEIAM
EKVPMVQKNI VSLCNRAGKP VIIATQMMES MIENPLPTRA ETNDIANAVI DGADALMLSG
ETALGKYPIK VVAEMKKTIL VVEQTAPIYN RYQDISSTSP TFYNDSLVRT ACRLSHDIQA
KAIICLTQTG WTALELAKHR PQANIFVFTD NQSLLNSINL IWNVRGYYYD SMVSTDQTFA
DIESLLTKNN YLKSGDVFIS MASMPIHSKQ RTNMLKINRV S