Gene Aasi_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1662 
Symbol 
ID6376325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp872329 
End bp873981 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content41% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573094 
Protein GI294661218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG TTAAAGAGAA TGCTCCTATA GGTTTTAGTA GAACGCAGTA TTTGGATTTA 
TATCTAGCAC CTGGTTTTAC AACCAACCAG TTAAGTAAAC ATAGCCCTGA ATGGCAAGAG
GCACATATAG GAGTAGTATT TCCTGAAAAA AGTTCAACTG GCAAAGGGTA TGTATATATA
GGTGACGGGG GGCTGCTCGG TGGTGGGAAT AGTGGTTCCA AAGGAGGCGG TGGAAATGAT
AGCGATAGGA CATCTAGCAG AAGTGAATCT AGTGAGAGGA GTACATCTGG CTCTAGCCAC
AGTTCAAGTA GTAGCAGTGG TAGAGCAAGT CATAATATTG GAAGCTCACA TAGTAGTTTC
AGTATGCCAA AAACTTCATC TTCCAGTTCC CATCATCGAG ATAGCTTTAA GGATTTTTGT
AATAAGAGTT CAGCTAATGT TTCAGCTTCG TTATCTTCTG CAGGTATAAA GCATGATACC
CATAGCACTT CCCATACTAG TGATAGTTTT AAGGATTTTT GTGATAAGAG TTCAGCTAAT
GTCTCAGCTA TGTTATCTTC TGCAGGTATA AAGCATGATA CCCATAGCAC TTCCCATTCT
AGCTTCTCCA GCCATACTAG TTCATATTCT AGCAGCAGGG CATCTAACCA AGCTGCTAGT
AGCAGTAATA GTGAAAGTAA GCCTGCTGCT TCTAAGGAGT CATCCTCTTC AAGCAAAGAA
TCTAAAGCCC CCCATCTTAC CCTAGAGAAT GCAGCTACGG AGGTGAGAAA GCACATACAA
GAGGCTAAAA ATGATGGTCC TAAAGCTACG AGTGTATCAG CCAGCCAGCA ACAACAAGCA
AAAGGGGAGC AACTACTTAA ACAGCTACGT GAAATCAAGC AACAACAGGA ACGGTCGTAT
AGCCAAACAC AGACCGCTTA CATTACACCA GGTAATTCTG AAATAGGGAA TAAGCTGTTG
TTAGACCAGC TAACCAAAAA AGGTCAAGAA TTGTCTACTT TAAAAGAATT AGAGTCAGAG
CTAACACAAA GCTTAGAGCG CGAGCGTCCA ACTAGCCATT ATGAGGAAAC AAGAAAAGAA
TCAATACTAG CTAGTGCTAC AACTACACCA AAAGAGATTA AGGTAAAAGA TAAAGGAAAA
GAAAAGGATA CAAGCTTGAC ACCTGGTACT ACCCAAGCAA CAAATAACCT TCATAGTACA
ACCCTATCTT CGTCTGGGGA CATTTCTACT GCTACAAGTA CTACTGAACC ACTACTCAGC
ACACCCACTA AAGGTCCAGC AGTTAAAGTT GAGAGTGGTA CACGTAAGAC AAAAATACCC
TACAATGATA GAGCGGACAG TTTTAGCTAT GACCACTACT CTTCAAGCAC TAGCACAAAT
ACACCTGCTG TATCAGCTAC AGAGCCTACT AACCCGGCTA CATCTAGTAC TACTTCCACT
AGTACTACTG ATAAAAGTGA AGCAGAAATT CATAACCAGT GGAGTGATTT GTTAAAGGAA
ATACAAGTCC AATTGGGACA TTCCAAAATA CAAGGTTTAG CAGACGCTCT ATATCTACAA
AAACAGGCTA GATTCTATAT AGAAAAATTA AAGTCGTATG AAAAATATAA ATGCATTAGC
AGATCTTCCC TCGAGCAAGC ACTCACTCAA TGA
 
Protein sequence
MAIVKENAPI GFSRTQYLDL YLAPGFTTNQ LSKHSPEWQE AHIGVVFPEK SSTGKGYVYI 
GDGGLLGGGN SGSKGGGGND SDRTSSRSES SERSTSGSSH SSSSSSGRAS HNIGSSHSSF
SMPKTSSSSS HHRDSFKDFC NKSSANVSAS LSSAGIKHDT HSTSHTSDSF KDFCDKSSAN
VSAMLSSAGI KHDTHSTSHS SFSSHTSSYS SSRASNQAAS SSNSESKPAA SKESSSSSKE
SKAPHLTLEN AATEVRKHIQ EAKNDGPKAT SVSASQQQQA KGEQLLKQLR EIKQQQERSY
SQTQTAYITP GNSEIGNKLL LDQLTKKGQE LSTLKELESE LTQSLERERP TSHYEETRKE
SILASATTTP KEIKVKDKGK EKDTSLTPGT TQATNNLHST TLSSSGDIST ATSTTEPLLS
TPTKGPAVKV ESGTRKTKIP YNDRADSFSY DHYSSSTSTN TPAVSATEPT NPATSSTTST
STTDKSEAEI HNQWSDLLKE IQVQLGHSKI QGLADALYLQ KQARFYIEKL KSYEKYKCIS
RSSLEQALTQ