Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0694 |
Symbol | |
ID | 6376847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 883230 |
End bp | 886106 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642681846 |
Product | hypothetical protein |
Protein accession | YP_001957813 |
Protein GI | 189502096 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGGAC TCAACATAGA CATGATAATT TTCGGCTTAT TTTTGGTTAT TAACCTATCA ATAGGCCTCT TTTCTAGCCG TCGAGTAACT TCTCTACGTG ATTATGCAGT AGGCAGAAAA GACTTTTCTA CAGCCACATT AACAGCTACT ATTGTAGTCA GCTGGATTGC AGGTATGTAT GTGTTTGAAA TATTAGAACA TACATACAGA GACGGTCTTT ATTTCATTAT AGTAGTAAGT GGTGCTTCTG TCTGCTTGTG GTTAGTGGGA TTATTAGCAG TGCGAATGCG AGAATTTTTA AAAAACCTTT CTGTAGCAGA AGCTATGGGA GATTTGTATG GAAAATCTGT ACAAATTATT ACAGCTATTA GTGGTATATT AAGTGTGATA ACTTTGGTAG CTATGGAATT TAAAGTAATT AGCAAAGTAA TCTGCTTAAT TTTTGCTACA GAAAGTGTAT GGGTGCCCGT AATAGCGTCT GTAGTGGTCA TTATATACTC AGTCTCAGGT GGTATACGGG CCATTACGTT TACTGATGTA ATACAATTTT TTACGTTTGG CACTTTTATT CCAATCCTCG CATTAGTTAT TTGGAATCAA CTCAAAGACC CACATCAAGT TATAGTCTTA CTTAATACCA ACCCTAATTT TAGCTGGTCA CAAGTAATTG GTTGGCATCC TAAATTCATC GATACGCTAT TCATGTTATT ATGGTTTACA ATTCCTGCTA TGGATCCTGT CATATTTCAA CGTATCAGTA TGGCAAAAGA TGTAGCACAA GTAAAAAAAT CCTTTAATTA TGCAGCTCTC ATCAGCTTAG TAGTTTGTTT GTTTTTAGCT TGGGTAGCTA TTTTGTTATT GGCTAGCAAT GCAAATTTAG AACCTGACAA GCTTTTTAAT TACATTATCA ATAATTATAC AACAGGCGGG TTACGGGGTC TAATTGGTGT AGGTGTGTTA GCCCTTGCTA TGTCCACGGC GGATTCTTAT CTCAATGCTT CTGCTGTTTT GTTAACCAAT GATATTGCGA AGCCTTTAGG ACTTAACTTT AGCAAGGAAA AGGAGGTAGT TGTTGCGAGA TCCCTTTGTT TACTCTCAGG GCTTTTTGCT TTGCTGATTG CCTTACAGTT TAAAGGTATA TTAGCTTTAC TGCAATTTGC CAATAGCCTG TATATGCCAG TTGTTACAGT ACCTTTATTA CTAGCTATTT TAGGTTTTAG GAGTACTACA TTGTCCGTAC TTATAAGCAT GGGTATGGGG TTCATTACCG TAGTAGGCTG GCCTTTAATT TTTAAGGGAA GCGATAGTAT TATTCCTGGT ATTGTCGCTA ATCTAGTAGG TTTACTTGGT AGCCATTATA TCCTTAAACA ACCAGGCGGC TGGGTAGGCG TACGGGAACC AGAACCTTTG TTAGAGGCAA GAGAAAATAG ACGAAAAGCT TGGAAAAGGT TTAAAAAAGA TTTTAAAGAA TTTAGCTTAG TACAATACTT ACAAAAAACT TTTCCTAATC AAGAATACCT ATTAACCATT TTTGGAATCT ATGTAATTGC TGCTACTTAT GCTTCCTTTT ATACAGTGCC CGAAGAAATA CAAACCAACC ACGCTAGGCT TTACCACATC ATTGGTCAAA GCGTACTCTT TATTAGTACA GGCTTATTAA CCTACCCCTT ATGGCCTCCT ATTTTTAAGA ATAAGTGGTT TATAACTTGG GCTTGGCCTT TAAGTGTTTT TTACGTACTT TTTGGTGTTG GCACTTGGCT GGTACTGATG AGTGGCTTTC ATACTTTCCA GACCATGATC TTTCTGCTCA ACATCGTTAT GGCATTTTTA CTATTTGATT GGCCTTTCGT TGTTACTATG GTTTTATCAG GAGTTATTAT AGCTACCAAT ATCTTTAAAC TATTCGCCTC AGCACCATCT ACCTCCCATG ACTTTGGCAC ATTACAATTT AAAATCTTAT ATGGCCTATT GCTTTTAAGT AGCTTCCTGT TAGCCATTTT CAAACATAAA CAAGCACAAA AGCAGCTAGC AAGTCGTAAT GAGTATCTTA AACTTCTCCA AGCAGAACGC GATGAAAAGC TAAAAATTGC TTTAGAGTAT CGAGAACGCT TTTCAAATGC TTTGTCTACT GATTGTGTAG AGGGATTCAT GTTGCTTTAT CAAAGGGGAA AAGAGCTCAT AGAAGCTGCT AAACAAGTAC AAAGTGCAGA ACAAGCAAGA GCTTTTATGG AAGATGTGTT AACGCTTCTA GCTAAGCAGC AGCAAGCAGG AGAATATTTG GCAGAAACTA TCTATCGTTT TAAAGAACAT ATGCGTTTAG ATGTGCAGAA AGTAGATCTC ACAGACTTTT TAGATGCTAC GCTAGAAAAT TTAGACATGA TAGATTTACA ACCGCGTCCT AAGGTTACAG TTCAGCTACA AACTCAGCAA AGAACTATTC AGTGCGATCC TAAACTTATA CAAAAGTTAC TTTATAATAG CCTACGTTCT ATTCAGGAGA AAAATCAAGA CAATAAGCCT ATTAGATTGA TTGTAGAAGA TGCAAGTTTA GTATATGATA TTCCTTTTAT TCCTAACTAT GCTAAAAAAT TGCCAGCTAT ACAATTTATG TTTACCACTG TTGATCAACT GTCAGCTAAG AAAAATAGCT ATGAAGTAAT AGAACCTGTT AACATATTTT TGCCTAAGCA TATAGAAGAT CTAGCTAGAG CTGAGAATGA GCAAATTATA GATGCACATT ATGGATCTGC AAGCTGGGAG GCTGATATTA AAGGATTAAC ACAAATCTAC GTTATTCCTG TAGAGATACG TAGAATTAGG CCAGCACTTA TGGATGAACC ACAAATGGTA TTAGATGGTA AGGGGCAAAC TACTTAA
|
Protein sequence | MLGLNIDMII FGLFLVINLS IGLFSSRRVT SLRDYAVGRK DFSTATLTAT IVVSWIAGMY VFEILEHTYR DGLYFIIVVS GASVCLWLVG LLAVRMREFL KNLSVAEAMG DLYGKSVQII TAISGILSVI TLVAMEFKVI SKVICLIFAT ESVWVPVIAS VVVIIYSVSG GIRAITFTDV IQFFTFGTFI PILALVIWNQ LKDPHQVIVL LNTNPNFSWS QVIGWHPKFI DTLFMLLWFT IPAMDPVIFQ RISMAKDVAQ VKKSFNYAAL ISLVVCLFLA WVAILLLASN ANLEPDKLFN YIINNYTTGG LRGLIGVGVL ALAMSTADSY LNASAVLLTN DIAKPLGLNF SKEKEVVVAR SLCLLSGLFA LLIALQFKGI LALLQFANSL YMPVVTVPLL LAILGFRSTT LSVLISMGMG FITVVGWPLI FKGSDSIIPG IVANLVGLLG SHYILKQPGG WVGVREPEPL LEARENRRKA WKRFKKDFKE FSLVQYLQKT FPNQEYLLTI FGIYVIAATY ASFYTVPEEI QTNHARLYHI IGQSVLFIST GLLTYPLWPP IFKNKWFITW AWPLSVFYVL FGVGTWLVLM SGFHTFQTMI FLLNIVMAFL LFDWPFVVTM VLSGVIIATN IFKLFASAPS TSHDFGTLQF KILYGLLLLS SFLLAIFKHK QAQKQLASRN EYLKLLQAER DEKLKIALEY RERFSNALST DCVEGFMLLY QRGKELIEAA KQVQSAEQAR AFMEDVLTLL AKQQQAGEYL AETIYRFKEH MRLDVQKVDL TDFLDATLEN LDMIDLQPRP KVTVQLQTQQ RTIQCDPKLI QKLLYNSLRS IQEKNQDNKP IRLIVEDASL VYDIPFIPNY AKKLPAIQFM FTTVDQLSAK KNSYEVIEPV NIFLPKHIED LARAENEQII DAHYGSASWE ADIKGLTQIY VIPVEIRRIR PALMDEPQMV LDGKGQTT
|
| |