Gene Aasi_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0917 
Symbol 
ID6377093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1174241 
End bp1175443 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content34% 
IMG OID642682051 
Producthypothetical protein 
Protein accessionYP_001958012 
Protein GI189502295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.151697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATA AAAAATTTAT ACTTAGTCTA AAATCCTCTT TTTTGATAAC AATATTCTCA 
CTGTTTGTTT TTACAGCATG CAGTGATGGA TGTAAATCAG CTCCCAAATT ACCCAGAGGT
AAAGGTAGTA ATTCAACAGC AAAAGGTACA ACAGGAAAGA AAGGTACAGC TGGAAATACT
GAAAAGGAAA GTAGTAAGAA AACAAATACT AAACCCACAC CTACTCCTTC TAATCTTCAA
AAAACAATTG TTTTGTTCCA TGGGCTTGGA GTTCCTGAAG ATACAGGAAT TTCAGCATTA
GGTAAGAAAT TGAAAAATGA TATTAAAGAT GTTAGAGTAA TGATTTTAAA ACGGCCTAAT
TCTACCACTG TTTCTACTAC ACAACAGGCA GAAGAAGCTT ATGCTACGCT AAAAGAAGAA
TTGAAAAAAA AGGATATGAT TGGCAGCCCT ATTTGTCTAA TAGGTGATAG CCATGGTGGT
TTAGTCGCAT TAGAGTTATA TCGTCAATAC AAAGATGATC TGAATATAGT AGGTATTGTT
ACTAACCACT CTCCGTTAGA AGGAGCACCA GGTGTAAATG TAGATATTGA TGTTGTTAAT
GAGTTTAAAA AAACAGTTAG AGGTTTATTG ACATCCGCAT CGTTATTCCT AGGAGGGCTA
ACTTTAAATC CTAGCCAAAT AGACTCTTTA CCATTGAAAG AAATGCTTAC AGGAGGGATA
CAGCAACCTT TGATAGATGA TCTAACAGCA AATAGTTCCT TATTAAACAA CATTCGAGCA
ACTTTGACAT CTATACAAAT ACCTGTGTTG GTGTTGGCAG GTTATGCAGA TATATTAACA
AACATCGATG CATTACTTGG TTTTGCTTCT CAAGATAAAA GATTTGGGCA AGAGTTGAGT
AAAGTAAGAG AACAGCTAAA GGAATTGCAA AAGGAAAAGG ATAAGATGGA AGAACCCTTG
AAGCTACTGA TAAATGGTGC GCTTAGCTCT CTTGAAGATA CATTTGGAAG AATTATTGGT
GATAAAAGGA ATGACTCTTT TCTTCCATAC TACAGTCAAT ATGCTGAACA TATTTCAGTA
AGTCCAAGCG TTATGCGTAA TCCTAAACAA GGATATCACC ACTTTTATGG TATGCAGTAT
CATAATGAAG TCTATAATGA TATAGTAAGA TTTGTTAATC AAGCTTTTGA GAACAAAAAA
TAA
 
Protein sequence
MIDKKFILSL KSSFLITIFS LFVFTACSDG CKSAPKLPRG KGSNSTAKGT TGKKGTAGNT 
EKESSKKTNT KPTPTPSNLQ KTIVLFHGLG VPEDTGISAL GKKLKNDIKD VRVMILKRPN
STTVSTTQQA EEAYATLKEE LKKKDMIGSP ICLIGDSHGG LVALELYRQY KDDLNIVGIV
TNHSPLEGAP GVNVDIDVVN EFKKTVRGLL TSASLFLGGL TLNPSQIDSL PLKEMLTGGI
QQPLIDDLTA NSSLLNNIRA TLTSIQIPVL VLAGYADILT NIDALLGFAS QDKRFGQELS
KVREQLKELQ KEKDKMEEPL KLLINGALSS LEDTFGRIIG DKRNDSFLPY YSQYAEHISV
SPSVMRNPKQ GYHHFYGMQY HNEVYNDIVR FVNQAFENKK