Gene Aasi_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1040 
Symbol 
ID6377047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1346503 
End bp1348434 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content37% 
IMG OID642682156 
Producthypothetical protein 
Protein accessionYP_001958117 
Protein GI189502400 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAC AATCTATAGA GGGTAATAAT TGGAAGCAGC TCAGATATTA CCTAGATTTT 
CCTCGGCAAA TTTTAAACTA TTCACAAAAA AAGGTAGCTC AGTATGGTGC CAATTATACT
GCGTTTGCTT TTTTTATGAC TTTCCATTAC ATGATACCTT TTTTCATGCG AACACAAGTA
AGCGTAGAAT CGATGTATGC ATGGCTATTG ATTATTAAGT TAATCGGTGC TTCGTTATGT
GTGGGGCTTC TTTTAAAACA TTACTGGCCT AGTAACTTAG TAAGCTACTT TCCTCTTTAT
TGGCACTTTA CCCTATTATA TTGTGTGCCT TTTTCTTTCA CACTAGTATT TCTAATTAAT
GGTAGCGGAG TAGAGTGGCT TGTAAATATA TCTTTAGGCA TTACCTTGCT CATTGTTTTA
GTTGATTGGA TTACTTTCTT AATACTCTCT ATACTAGGCA CAACATTAGG ACTCTTGTTT
TATGTACAAT TTATAGGGGC CCTACCTTTA CTAGATATAT ATAGCCTCTA TACATTAAGT
TATGTGTTCT TAGCTTCTTT GAGCATTGGG CTTATATTTG CTCGCAGGAA AGAACAAAGT
TTTGACTTAC TGCTGCTACA AAATCAACAG CTTAGCGAGG CTCAACAAAA AAACAGAGAG
GAATTAATAG AAACGCTTAA ATATAGACAA CAACTATTGC AAGAACTTAA ACCTGAAGAA
GTAGCTATCT TTGACGAAGT AACCACATCT TACATGAAGC AAGCCATCTA TCGCATGACT
GATTATATGC GGCTAGATGT AACCTCCGTA ACCTTAGACG ACCTGCAACA AGCATTGGTA
GATCTTTATA AATATAAATT ACAAACTCCT GATCAGCCTG AGATTTTATT TAAGAAAGAC
ACCAACCAGA CTGTTTTACA AGCAGATTTT GTTAAGCTAA AGCAACTATT ACTAAATTCG
ATTAGCTATG TACAACAATA TAATACTTCC CATAGTCCCA TTATTGTAGC TATAGAAGAT
GCTTGGCTAG GCCATGAAAT AGCTCATATG CAAAACTATA CGCGTAAGCT AGAGGCTCTA
AGGATAACCA TTACAATAGA ACCAAGCGTA CCTCCTACCC AACCCCTTTA TAAAATAGAT
CCTGCTAAAT CTAGTACATG GGTACCTCAG CATGAAGATG AATTTTTACT GATAGAGAAT
GCACGCATCA TTGATGCGCA CTACGGCTAT ATAGCTACTA AGTCAACACA TACTCAAATG
TATGTATTAC CTGTCAAGCT AAGAGAAATT CGTGGTAAGG TTATGGAGCT TATCAAGGAG
CCGGCAGCAG CTGTCCCCGA AGAGGTAAAT CACCCTTTGG CCATACAGTT AGAGCAAGAA
CTAATGGAAA GGCTTCGAGA TACAGAAGTA GATATCTGTG TTATTCAAAA AGGACTAGAT
ATTATTAAAA GATATCATGG TGGCGTAAAA AGGAAATCAG GAGAGCCTTT TTTTACACAT
CCTATGACTG TAGCTTTGAT TTTACTGGAA TATTCACAAG ACCAAGATGC CATTTTAGGT
GCTTTGCTAC ACGATACAGT AGAAGATACA AGTTTATCGC TGACACACAT CCGGATGATA
TTTGGAAAAA CAGTAGAATT TTTAGTAGCC AAAGCAACGA ATTTAGAAGA CCATAAACGG
CGTGTAAGTT TAACTGATCA AGAAAACTTG GCAAGAATTT TAAACTATGA AGACCCTAGA
GCAGCTTTGA TTAAATTGTC GGATAGATTG CATAATATGC GTACCATTCA ATTTCACTCT
TCAGTAGCCA AGCGTAAACA TATTTCTCAA GAGACATTAG ACCACTTTGT ACCCTTGGCC
AGAAAGTTAG GTTTAGAAAA GATGGCAGCC GAGTTAGAAC AGTTGAGCAA TGAAGTAGTA
AACAAGAAAT AA
 
Protein sequence
MSVQSIEGNN WKQLRYYLDF PRQILNYSQK KVAQYGANYT AFAFFMTFHY MIPFFMRTQV 
SVESMYAWLL IIKLIGASLC VGLLLKHYWP SNLVSYFPLY WHFTLLYCVP FSFTLVFLIN
GSGVEWLVNI SLGITLLIVL VDWITFLILS ILGTTLGLLF YVQFIGALPL LDIYSLYTLS
YVFLASLSIG LIFARRKEQS FDLLLLQNQQ LSEAQQKNRE ELIETLKYRQ QLLQELKPEE
VAIFDEVTTS YMKQAIYRMT DYMRLDVTSV TLDDLQQALV DLYKYKLQTP DQPEILFKKD
TNQTVLQADF VKLKQLLLNS ISYVQQYNTS HSPIIVAIED AWLGHEIAHM QNYTRKLEAL
RITITIEPSV PPTQPLYKID PAKSSTWVPQ HEDEFLLIEN ARIIDAHYGY IATKSTHTQM
YVLPVKLREI RGKVMELIKE PAAAVPEEVN HPLAIQLEQE LMERLRDTEV DICVIQKGLD
IIKRYHGGVK RKSGEPFFTH PMTVALILLE YSQDQDAILG ALLHDTVEDT SLSLTHIRMI
FGKTVEFLVA KATNLEDHKR RVSLTDQENL ARILNYEDPR AALIKLSDRL HNMRTIQFHS
SVAKRKHISQ ETLDHFVPLA RKLGLEKMAA ELEQLSNEVV NKK