Gene Aasi_0694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0694 
Symbol 
ID6376847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp883230 
End bp886106 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content36% 
IMG OID642681846 
Producthypothetical protein 
Protein accessionYP_001957813 
Protein GI189502096 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGGAC TCAACATAGA CATGATAATT TTCGGCTTAT TTTTGGTTAT TAACCTATCA 
ATAGGCCTCT TTTCTAGCCG TCGAGTAACT TCTCTACGTG ATTATGCAGT AGGCAGAAAA
GACTTTTCTA CAGCCACATT AACAGCTACT ATTGTAGTCA GCTGGATTGC AGGTATGTAT
GTGTTTGAAA TATTAGAACA TACATACAGA GACGGTCTTT ATTTCATTAT AGTAGTAAGT
GGTGCTTCTG TCTGCTTGTG GTTAGTGGGA TTATTAGCAG TGCGAATGCG AGAATTTTTA
AAAAACCTTT CTGTAGCAGA AGCTATGGGA GATTTGTATG GAAAATCTGT ACAAATTATT
ACAGCTATTA GTGGTATATT AAGTGTGATA ACTTTGGTAG CTATGGAATT TAAAGTAATT
AGCAAAGTAA TCTGCTTAAT TTTTGCTACA GAAAGTGTAT GGGTGCCCGT AATAGCGTCT
GTAGTGGTCA TTATATACTC AGTCTCAGGT GGTATACGGG CCATTACGTT TACTGATGTA
ATACAATTTT TTACGTTTGG CACTTTTATT CCAATCCTCG CATTAGTTAT TTGGAATCAA
CTCAAAGACC CACATCAAGT TATAGTCTTA CTTAATACCA ACCCTAATTT TAGCTGGTCA
CAAGTAATTG GTTGGCATCC TAAATTCATC GATACGCTAT TCATGTTATT ATGGTTTACA
ATTCCTGCTA TGGATCCTGT CATATTTCAA CGTATCAGTA TGGCAAAAGA TGTAGCACAA
GTAAAAAAAT CCTTTAATTA TGCAGCTCTC ATCAGCTTAG TAGTTTGTTT GTTTTTAGCT
TGGGTAGCTA TTTTGTTATT GGCTAGCAAT GCAAATTTAG AACCTGACAA GCTTTTTAAT
TACATTATCA ATAATTATAC AACAGGCGGG TTACGGGGTC TAATTGGTGT AGGTGTGTTA
GCCCTTGCTA TGTCCACGGC GGATTCTTAT CTCAATGCTT CTGCTGTTTT GTTAACCAAT
GATATTGCGA AGCCTTTAGG ACTTAACTTT AGCAAGGAAA AGGAGGTAGT TGTTGCGAGA
TCCCTTTGTT TACTCTCAGG GCTTTTTGCT TTGCTGATTG CCTTACAGTT TAAAGGTATA
TTAGCTTTAC TGCAATTTGC CAATAGCCTG TATATGCCAG TTGTTACAGT ACCTTTATTA
CTAGCTATTT TAGGTTTTAG GAGTACTACA TTGTCCGTAC TTATAAGCAT GGGTATGGGG
TTCATTACCG TAGTAGGCTG GCCTTTAATT TTTAAGGGAA GCGATAGTAT TATTCCTGGT
ATTGTCGCTA ATCTAGTAGG TTTACTTGGT AGCCATTATA TCCTTAAACA ACCAGGCGGC
TGGGTAGGCG TACGGGAACC AGAACCTTTG TTAGAGGCAA GAGAAAATAG ACGAAAAGCT
TGGAAAAGGT TTAAAAAAGA TTTTAAAGAA TTTAGCTTAG TACAATACTT ACAAAAAACT
TTTCCTAATC AAGAATACCT ATTAACCATT TTTGGAATCT ATGTAATTGC TGCTACTTAT
GCTTCCTTTT ATACAGTGCC CGAAGAAATA CAAACCAACC ACGCTAGGCT TTACCACATC
ATTGGTCAAA GCGTACTCTT TATTAGTACA GGCTTATTAA CCTACCCCTT ATGGCCTCCT
ATTTTTAAGA ATAAGTGGTT TATAACTTGG GCTTGGCCTT TAAGTGTTTT TTACGTACTT
TTTGGTGTTG GCACTTGGCT GGTACTGATG AGTGGCTTTC ATACTTTCCA GACCATGATC
TTTCTGCTCA ACATCGTTAT GGCATTTTTA CTATTTGATT GGCCTTTCGT TGTTACTATG
GTTTTATCAG GAGTTATTAT AGCTACCAAT ATCTTTAAAC TATTCGCCTC AGCACCATCT
ACCTCCCATG ACTTTGGCAC ATTACAATTT AAAATCTTAT ATGGCCTATT GCTTTTAAGT
AGCTTCCTGT TAGCCATTTT CAAACATAAA CAAGCACAAA AGCAGCTAGC AAGTCGTAAT
GAGTATCTTA AACTTCTCCA AGCAGAACGC GATGAAAAGC TAAAAATTGC TTTAGAGTAT
CGAGAACGCT TTTCAAATGC TTTGTCTACT GATTGTGTAG AGGGATTCAT GTTGCTTTAT
CAAAGGGGAA AAGAGCTCAT AGAAGCTGCT AAACAAGTAC AAAGTGCAGA ACAAGCAAGA
GCTTTTATGG AAGATGTGTT AACGCTTCTA GCTAAGCAGC AGCAAGCAGG AGAATATTTG
GCAGAAACTA TCTATCGTTT TAAAGAACAT ATGCGTTTAG ATGTGCAGAA AGTAGATCTC
ACAGACTTTT TAGATGCTAC GCTAGAAAAT TTAGACATGA TAGATTTACA ACCGCGTCCT
AAGGTTACAG TTCAGCTACA AACTCAGCAA AGAACTATTC AGTGCGATCC TAAACTTATA
CAAAAGTTAC TTTATAATAG CCTACGTTCT ATTCAGGAGA AAAATCAAGA CAATAAGCCT
ATTAGATTGA TTGTAGAAGA TGCAAGTTTA GTATATGATA TTCCTTTTAT TCCTAACTAT
GCTAAAAAAT TGCCAGCTAT ACAATTTATG TTTACCACTG TTGATCAACT GTCAGCTAAG
AAAAATAGCT ATGAAGTAAT AGAACCTGTT AACATATTTT TGCCTAAGCA TATAGAAGAT
CTAGCTAGAG CTGAGAATGA GCAAATTATA GATGCACATT ATGGATCTGC AAGCTGGGAG
GCTGATATTA AAGGATTAAC ACAAATCTAC GTTATTCCTG TAGAGATACG TAGAATTAGG
CCAGCACTTA TGGATGAACC ACAAATGGTA TTAGATGGTA AGGGGCAAAC TACTTAA
 
Protein sequence
MLGLNIDMII FGLFLVINLS IGLFSSRRVT SLRDYAVGRK DFSTATLTAT IVVSWIAGMY 
VFEILEHTYR DGLYFIIVVS GASVCLWLVG LLAVRMREFL KNLSVAEAMG DLYGKSVQII
TAISGILSVI TLVAMEFKVI SKVICLIFAT ESVWVPVIAS VVVIIYSVSG GIRAITFTDV
IQFFTFGTFI PILALVIWNQ LKDPHQVIVL LNTNPNFSWS QVIGWHPKFI DTLFMLLWFT
IPAMDPVIFQ RISMAKDVAQ VKKSFNYAAL ISLVVCLFLA WVAILLLASN ANLEPDKLFN
YIINNYTTGG LRGLIGVGVL ALAMSTADSY LNASAVLLTN DIAKPLGLNF SKEKEVVVAR
SLCLLSGLFA LLIALQFKGI LALLQFANSL YMPVVTVPLL LAILGFRSTT LSVLISMGMG
FITVVGWPLI FKGSDSIIPG IVANLVGLLG SHYILKQPGG WVGVREPEPL LEARENRRKA
WKRFKKDFKE FSLVQYLQKT FPNQEYLLTI FGIYVIAATY ASFYTVPEEI QTNHARLYHI
IGQSVLFIST GLLTYPLWPP IFKNKWFITW AWPLSVFYVL FGVGTWLVLM SGFHTFQTMI
FLLNIVMAFL LFDWPFVVTM VLSGVIIATN IFKLFASAPS TSHDFGTLQF KILYGLLLLS
SFLLAIFKHK QAQKQLASRN EYLKLLQAER DEKLKIALEY RERFSNALST DCVEGFMLLY
QRGKELIEAA KQVQSAEQAR AFMEDVLTLL AKQQQAGEYL AETIYRFKEH MRLDVQKVDL
TDFLDATLEN LDMIDLQPRP KVTVQLQTQQ RTIQCDPKLI QKLLYNSLRS IQEKNQDNKP
IRLIVEDASL VYDIPFIPNY AKKLPAIQFM FTTVDQLSAK KNSYEVIEPV NIFLPKHIED
LARAENEQII DAHYGSASWE ADIKGLTQIY VIPVEIRRIR PALMDEPQMV LDGKGQTT