Gene Aasi_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0352 
Symbol 
ID6377297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp413074 
End bp414201 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content37% 
IMG OID642681522 
Producthypothetical protein 
Protein accessionYP_001957506 
Protein GI189501789 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.332966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGTA TTTATATACA TATTCCCTTT TGCAAACAAG CATGTCATTA CTGTGACTTC 
CACTTTAGCA CTAACATGCG ACAAAAAAGT GAGATGCTTG CAGCTATTCA GCAAGAGTTA
ATCCTGCAAA AAGATTATTT AAACCAAGTT GAAATAAATA GCATATACTT AGGTGGCGGC
ACACCATCTT TACTAGAAGT CGAAGAGATA GCGGCCTTAT TAATAGATAT TCGGCAGATC
TTTAGATGTA AAGAAGGATT AGAAATTACG TTAGAAGCCA ATCCAGATGA TATAGACTTA
AAAAAGCTAC TGGCCTTACT TAAAATTGGC ATTAATAGGC TGAGCATTGG CATACAAACT
TTTCACAATA ACCTACTCCA GTATCTTAAC CGGGCACATG ATACTACAAA AGCCACACAG
AGCCTAGAAA TTGCTTTGCA AGCTGGCTTT AATAACTTTA ATCTGGATTT AATCTATGCC
ATACCGGGCC AAACCAAAGA AATGCTAAAA AGGGATTTAT TGACTGCCTT ACAATTTAAA
CCTACACATA TATCCGCTTA TTGCCTTACC ATTGAACAAA AAACTGTTTT CGGCCGCTGG
CTAGAAACTG GGAAAATCGA GGTAGTACAA GATGAGATAG CAGCAAAACA TTTCCATTTG
TTAGTTGATA CGTTAACAGA TAATGGGTAC AACCATTACG AAGTATCTAA TTTTGGCTTA
CCAGAGTACC ATGCGCAACA TAATACTAAC TATTGGAAAA GGGGAAGTTA TTTAGGCGTA
GGGCCTGGGG CTCATTCTTA TAACGGAACC AGTAGACAGT ATAACATAGC TAACAATACG
CGCTATATTC AAAGTATACA GGCAGGTATT ATACCCAGCA CTATAGAAAT CCTTCAGCCA
CAAGATCATA TTAATGAGTA TATTATGACA AGCCTACGTA CTCAATGGGG ATGTGATATG
GCATTGCTTA GAAACAACTA TCAATATGAT TTGCAAGCAA CCCACCCTTC TTATTTAGAA
CTACTTATTA ACAGGCAACT TGCCTATATA CAAGAAAATG TATTGATATT GACTGTAGAT
GGGAAACTAA TAGCAGATAA AATAGCTGCA GATTTATTTA TTGCTTAG
 
Protein sequence
MAGIYIHIPF CKQACHYCDF HFSTNMRQKS EMLAAIQQEL ILQKDYLNQV EINSIYLGGG 
TPSLLEVEEI AALLIDIRQI FRCKEGLEIT LEANPDDIDL KKLLALLKIG INRLSIGIQT
FHNNLLQYLN RAHDTTKATQ SLEIALQAGF NNFNLDLIYA IPGQTKEMLK RDLLTALQFK
PTHISAYCLT IEQKTVFGRW LETGKIEVVQ DEIAAKHFHL LVDTLTDNGY NHYEVSNFGL
PEYHAQHNTN YWKRGSYLGV GPGAHSYNGT SRQYNIANNT RYIQSIQAGI IPSTIEILQP
QDHINEYIMT SLRTQWGCDM ALLRNNYQYD LQATHPSYLE LLINRQLAYI QENVLILTVD
GKLIADKIAA DLFIA