Gene Aasi_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1744 
Symbol 
ID8999475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1128105 
End bp1130030 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content36% 
IMG OID 
ProductNa-solute symporter 
Protein accessionYP_003573149 
Protein GI294661273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.488974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGCAA GGCAAGCCCG CCAAGAAGCT TGGAAAAGGT TTATTGGTAA CTTTAAACAG 
GCTAATATAT ACGATTACCT ACAAAAAAAC TTACCTGCTT ATGAAGTAGT ATATTCTATG
TTCGCCATTT ATGTAATTGG GGCTACCTAT GCTTCTTTTT ATACTGTATC TGAAAAGATT
ATTAGTAATC ACCGGGATCT GTATGACTTT GCTGCACATT CTGTCCTGTT TGTGACAGCT
GGATTCTTAA CCTACCCGGC TTGGCCCCCC ACTTTTAAAG GTAAACGTTT TATAACCTTT
GCTTGGCCAG CAGGTATATT TTATGTTCTA TTCGTCATAG GTATCATTCT AGTACTCATG
AGTGGTTTTC ATGAAGTACA GGTAATGATT TTTTTGCTTA ACCTAGTTAT GGCTGCATTC
TTATTATCAT GGCCTTTAAT GCTCTTTTTA TCTATTACAG GTGTCTTAAT AGGAATTTTG
GCAATGTATA TGCAGTGTGG CCATTTGTAC TGTGGTAATA TAGATGGTAC AGCAGAATTT
AAGGTTGTTT ATGGTATTCT TTTAATAAGT AGCTTTCTAA TTGCGATTTT CAGGTTTAAA
GAGAATAAAA AGAAATTAGA AAGTAAAAAC ATTTATCTAG CTAGGTTGTA TGAGGAAAAG
AGTAACGAGC TAGCAGAGAT TTTAGGCTAT AGAGAACAAA TAATAAAAGA ACTAAGTGAG
GATGAGAAAA GATTGTTTGA TGATACTACG GCTGCTTATA TAGAGCAGAT CATCTACCGA
ATGACGGATT ATATGCGTCT AGAGGTGACT ACAATTAATT TAGATCAGCT TTTGTTAGAA
GTTAAAGATA TTCTGAAGCT CAAAGAGCTT GATAACATGA CTCAATTGAT AACTAAAAAA
CTGACAAAAG AAGAATCCAT TCATGGCGAT ACAGCAAAAC TCAAGCAGTT ATTAGTGAAT
GCTATCCTAT ATGTCCAAAA GCACAGCCTT TCGAACCAAC CTATCACGTT GATAATAGAA
GATGCCAAGC TAGGTCATCG AGTAGATTAT ATTAAAGATT ATACCAGGCA ATTAGCAGCT
TTAAAATTTA CCATTACCAT AGAAAAGGAC ATACCGACTA AAAAGGATCT CTATATGATC
GACCAACTGC CTTTGTTAAG TCAACACAGT AGAAAAGGTA AATTAATAGA AAATGCTCGT
ATCATTCATG CGCACTATGG CTATGCCAAC CTAGACAGCG AGCATACACA GGTATATGTA
CTCCCTATTA ACGTGCGAGA AGTAAGAGGC AAAGTGATGG AATTATTAAG GGAGCCAGTA
GAGGCAGATA GAGAAGAAGT AAGACATCCA TTGGCTATCG AGCTTGAAAA AGAGTTAATG
GATAAAATAA AAGGGAAAAA GATAGATGGT AAGGTTATTA ATAAGGCGCT GGATACTATT
AAAAGATATC ATGCAGGCGT TAAGCGTAAA TCAGGCGAAC CTTTCTTTAC GCATCCTATT
AATGTGGCTT TGATTCTATT AGAATACTGC CAAGATCAGG ACGCAGTTAT AGCAGCATTA
TTACATGATA CGGTAGAGGA TACGAGTCTT TCACTTGTTC AAATTAAATC TATGTTTGGC
GAGGATGTGG CTTTTATAGT AAACAAAGTA ACTAACCTAG AAGATAACTT GCGTAGAGTC
AGTTTAGAAG ACCATGAGAA TGTCTATCGT TTAATGAACT ATGAAGATGA GCGGGCAGCT
TTTGTCAAAT TAGCAGACAG GCTGCATAAC ATGCGCACTA TCAGTGGTCA TTCTTCACTT
GCCAAGCAAA AACATATAGC AACTGAAACG TTAAATTTCT TTGTTCCGCT AGCTAAAAAC
TTAGGTCTTA CAACTGTATC ACAGGAGCTA GAAAAGCTTA GTTTAGAGGT ACTCGGTAAG
AAATAA
 
Protein sequence
MAARQARQEA WKRFIGNFKQ ANIYDYLQKN LPAYEVVYSM FAIYVIGATY ASFYTVSEKI 
ISNHRDLYDF AAHSVLFVTA GFLTYPAWPP TFKGKRFITF AWPAGIFYVL FVIGIILVLM
SGFHEVQVMI FLLNLVMAAF LLSWPLMLFL SITGVLIGIL AMYMQCGHLY CGNIDGTAEF
KVVYGILLIS SFLIAIFRFK ENKKKLESKN IYLARLYEEK SNELAEILGY REQIIKELSE
DEKRLFDDTT AAYIEQIIYR MTDYMRLEVT TINLDQLLLE VKDILKLKEL DNMTQLITKK
LTKEESIHGD TAKLKQLLVN AILYVQKHSL SNQPITLIIE DAKLGHRVDY IKDYTRQLAA
LKFTITIEKD IPTKKDLYMI DQLPLLSQHS RKGKLIENAR IIHAHYGYAN LDSEHTQVYV
LPINVREVRG KVMELLREPV EADREEVRHP LAIELEKELM DKIKGKKIDG KVINKALDTI
KRYHAGVKRK SGEPFFTHPI NVALILLEYC QDQDAVIAAL LHDTVEDTSL SLVQIKSMFG
EDVAFIVNKV TNLEDNLRRV SLEDHENVYR LMNYEDERAA FVKLADRLHN MRTISGHSSL
AKQKHIATET LNFFVPLAKN LGLTTVSQEL EKLSLEVLGK K