Gene Aasi_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0789 
Symbol 
ID6376764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1004896 
End bp1006335 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content35% 
IMG OID642681933 
Producthypothetical protein 
Protein accessionYP_001957896 
Protein GI189502179 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.545015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAT TACGTTTAAC CCAAAAACTT ACTCAAAAGT TAACCCCTCA ACAAATTCAA 
TTAGTAAAGC TTTTACAAAT TCCTTCCATA GATATTAATG CACGTATACA GCAAGAATTA
GCAGAAAACC CTGTTTTAGA ACCAGCTAAT GATAATGTTG AGTATCTGCA AGAAGCTGAA
GATAATTCTG CTATAGATAC GATGACTGTT GGTTATATAG ATGAGTTACC ACAAAAAAAC
TACCTTACAT ATCGCGATTA TGAGGCAGAA GAACGATGGC TCAGAAAGGA AGCTTCAGTT
CCCAATAGAT ATTCTTTGCA AGAACAGCTA TTAGAACAAT TAGAAATGCT TCGTTTGGAT
GAAGGTCAAC ATAAGATTGG TGAATACCTT ATTGGAAGTT TAGAAACAGA TGGTTATATA
CGGCGTGATT TGGCAGCGCT TGCTAATGAT CTTTTATTAA CACAATATAT TCAGACCAAT
GAAACAGAAG TAGAAACAAT ACTTAAAAAA ATACAGACAT TTGATCCTCC TGGAATCGGC
GCTAGAAGCT TACAAGAATG CTTGCTTATT CAGCTAAATA GGCAACCATC TAAGCCTGTA
CGTAATATGG CTATTCAAAT CTTAAATCAG ACTTTTGAAT TGTTTACCAA AAAACATTAT
TCCCAAATTA TTAAAAAACT TGCCATAGAA GATCAGGAAC TTTTTAAGGA AGCGTTAGAA
CTTATTGCAA AGCTAAATCC TAAACCAGGT GGAACATTAG GTTCAGATAT TCAACAAAAC
CAGATTTTAT ATCCAGATTT TCTAGTCACT AAGCATAACC AACAATTGGA GGTACGGTTG
AATGCTTATA ATACACCTGA GCTTAGACTG AGCAGGAGCT ATATACACAT TTTAGAAACC
TCTCAGAAAC TAGGTAAAGA CAAAAAAAAG AGTCCCAGCT TACAAGCAGC TTCTTCTTTT
GTTAAACAAA AAGCAGAGTC AGCAAAGTGG TTTATAGAAG CATTGAACCA GCGTAAAAAG
ACGCTTTTGC GTACCATGCA GGCTATAGTA CAGTTGCAAT ACGATTTTTT TATGGAAGAA
GATGAAAGTA AGCTTAAACC GATGATATTA AAAGATGTAG CAAAAATTAT TGAAATGGAT
ATTTCTACAG TTTCTAGAAT TGTTAATAAT AAATCTGTGC AAACACAGTT TGGCATTTAC
CCTCTTAAAT TCTTCTTTAC TGAAGCTATC AGTACTACTA TGGGAGAAGA TGTAAGTAGT
AAAGCTGTAA AACAAGCATT GCTAGAACTT ATTCAACAAG AGAGCAAACA ACAACCTTAT
GCAGATGAGC AACTTACTGC TATGCTTGTT GATAAGGGTT ATCATATAGC ACGTCGTACT
GTAGCTAAAT ATAGGGAGCA GCTTAATATG CCTGTAGCTA GGTTAAGGAA AGAACTGTAG
 
Protein sequence
MQKLRLTQKL TQKLTPQQIQ LVKLLQIPSI DINARIQQEL AENPVLEPAN DNVEYLQEAE 
DNSAIDTMTV GYIDELPQKN YLTYRDYEAE ERWLRKEASV PNRYSLQEQL LEQLEMLRLD
EGQHKIGEYL IGSLETDGYI RRDLAALAND LLLTQYIQTN ETEVETILKK IQTFDPPGIG
ARSLQECLLI QLNRQPSKPV RNMAIQILNQ TFELFTKKHY SQIIKKLAIE DQELFKEALE
LIAKLNPKPG GTLGSDIQQN QILYPDFLVT KHNQQLEVRL NAYNTPELRL SRSYIHILET
SQKLGKDKKK SPSLQAASSF VKQKAESAKW FIEALNQRKK TLLRTMQAIV QLQYDFFMEE
DESKLKPMIL KDVAKIIEMD ISTVSRIVNN KSVQTQFGIY PLKFFFTEAI STTMGEDVSS
KAVKQALLEL IQQESKQQPY ADEQLTAMLV DKGYHIARRT VAKYREQLNM PVARLRKEL