Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0697 |
Symbol | |
ID | 6376888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 887580 |
End bp | 890441 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681848 |
Product | hypothetical protein |
Protein accession | YP_001957815 |
Protein GI | 189502098 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAGGTT TGAATATAGA CATGTTAATT CTCGTCTTAT TTCTAGGTAT TAACCTAGTC ATAGGACTTT TCTCTAGCCG CCGAGTAACC TCTCTACAAG ATTATGCAAT AGGCAGAAAA GATTTTTCTA CAGCTACCTT AACAGCTAGT ATTGTAGTTA GCTGGGTTGG TAGTTGGTAC GTTTTTGAAA CGCTAGGGCA TACCTATACA GATGGACTAT ATTTTATCAT AGCTATCACT GGTGCATGCA CCTGCTTGGT TATTGTTGGG TTATTAGCTG TACGCATGCA GGAGTTTTTA AAAAACATCT CCGTCGCAGA AGCTATGGGA GGCATTTACG GTAAAACTGC ACAAACTATT ACAGCTATTA GTGGGGTTTT AAGCGTATTG ACAATAGTAG CGATGGAATT TCATGTAATT AGCCGAATCA TTAGCTTGAT ATTTAATACT GAAAGTATCT GGACGCCTGT AATTGCTGCT GTCGTGGTAA TTGTGTACTC AGTTTCTGGA GGTATCCGTG CAGTTACTTT TACAGATGTA ATACAGTTTT TTACATTTGG TACGTTTATT CCTATCTTGG CACTAGTCAT CTGGAATCAG CTAAAAGACC CAAATCAGGT AATAACCTTG CTTAATACCC ATCCTAATTT TAGCTGGTCT CACGTAATAG GATGGGATCC TAAATTTTTA GATGCTTTGG CAATGATGCT ATGGTTTCTC ATCCCTGCTA TGGATCCTGT TATTTTTCAG CGGATTAGCA TGGCGCGGGA TATACAGCAA ATAAAAGAAT CTTTTAGCTA TGCTGGTCTC ATTAGCTTAG TCGTGTGTTT GTTTTTAGCT TGGATAGCTA TTTTATTATT AGCTAACAAT CCAAATTTGG AAGCTGACAA GCTTGTCGAA CACATTATTT ATAATTACAC GTCAGCTGGC TTACGTGGGT TGATAGGCAT AGGTGTTTTA GCTCTGGCTA TGTCTACGGC TGATTCTTAC CTGAATGCTT CTGCTGTTTT ATTAACCAAT GATATTGCCA AGCCCTTGGG CATAAAATTC AAAAATGAAG TGCTGGCTGC CAGGGTTTTT TGCGGCTTGT CGGGTATATT TGCCTTACTT ATTGCTTTGC AATTCAAAGG AATCTTGTCA TTACTGCAAT TTGCCAATAG CCTGTATATG CCGGTTGTTA CAGTACCCTT GTTAATGGCT ATTTTCGGAT TTAGAAGTAG CACATTAGCT GTATTAATAG GGATGGCTGG GGGGCTTGGA ACGACACTCG GTTGGCCTTT TATTATTAAA GAAAGCCATG GTATTCTTCC AGGAATCATG GCTAACTTAA TAGGATTACT AGGTAGCCAT TATATTCTCA AACAACCAGG TGGTTGGGTA GGTATACGCG AACCAGAACC TTTGTTAGAA GCAAGAGAGA ATAGACGCAA AGCTTGGAAC CAATTTAAAA AAGACTTTAA AGAGTTTAGC TTAGTACAGT ATCTGCAAAA GAGCTTACCT AATCAAGATT ATCTATTGAC CATTTTTGGA ATCTATGTCA TTGCTGCTAC TTATGCTTCC TTTTATACAG TACCAGAAGA GATACAAGCC AATTATGCCA AGCTTTATCA TATTATTGGT CAAAGCGTAC TATTTATTAG CACAGGCTTG CTAACCTATC CCCTATGGCC TCCTATTTTT AAGAACAAGT GGTTTATAAC CTGGGCCTGG CCTTTGAGTG TTTTTTATGT ACTCTTTGCA GTGGGTACTT GGCTAGTACT GATGAGTGGC TTTCATACTT TCCAAACTAT GATCTTTCTT CTGAATGTGG TGATGGGATT CTTATTACTT CCTTGGCATC TAGTAAGCAT TATGGTCATT ATAGGTGTAA CAACTGCTAC TTATATTTTT AAAATATATG CACAAGTACC TATCCTTCCT GACGACTTCG GTACCTTACG ATTCAAAATA TTATATGGTC TGCTACTAGC AAGTAATTTC ATAGCTTTAT TTAAGTATCA GCAAGCACAA GGAAAGCTAG TAAGCCACAA TCAAGCCCTT AATATTCTTC AAGCTAAGCG CACTATCAAC TTACGGGAAG CATTACAACA CCGGGAACGG TTTATGCATA CTTTAGCTAT CAATTGTGTA GAAGGCTTTA ATTGGTTGTA CCAACAAAGC AAAATACTTT GGACGTCTTT TAAACCAACA GAAATAACTT CATCATATAA GGACCTAATA AATGAAGCGG TACTACTTTT AGCTAAACAG CAACAAGCTA GCGAATACTT AGCACAAACC ATTTTTCCTT TTAAAAATTA TCTACGCTTA AATGTAGAGA AAGTTAACCT AGCAAATTTC CTAAATACTG CGCTAGAAAA TTTAGATAAG ATAAATATAC AAGCCCAACC AAAAATTACT TTACAACAGC TTACACACTA TCAAGAATTA GAAATAGATC CTGTACAAAT TCAAAAGCTG TTATACAATA CTTTGCAAAC TATACAGGCA AAAAATCATG CCAATAAGCT TATTACCCTT TTGGTAAAAG ATGCCACTTT AGTTTATGAG ATGCCATTTA TTCCTAACTA TAATAAAGAA ATATCAGCTA TACAATTTAT ACTTACGACT ATTGAGCAAC CAGCAAAGGG TACAACTAAT AACCATGATG CATTAGAAAC TGTCCATATA TTTTTACCCA AGCATATAGA AAATGTACCT AGCGAAGAAA ACCAACGAAT TATAGAAGCT CATTATGGAT ATGCAAGTTG CGAAACTGAA AATAAAGATA TCACCCAGAT CTACATCATT CCAGTGGCAC TAAGAAAAAT TCGGCCAACA ATTATAGATG AAAGCAAAAA GGTATTAGAT GAAGTAGTGT AA
|
Protein sequence | MLGLNIDMLI LVLFLGINLV IGLFSSRRVT SLQDYAIGRK DFSTATLTAS IVVSWVGSWY VFETLGHTYT DGLYFIIAIT GACTCLVIVG LLAVRMQEFL KNISVAEAMG GIYGKTAQTI TAISGVLSVL TIVAMEFHVI SRIISLIFNT ESIWTPVIAA VVVIVYSVSG GIRAVTFTDV IQFFTFGTFI PILALVIWNQ LKDPNQVITL LNTHPNFSWS HVIGWDPKFL DALAMMLWFL IPAMDPVIFQ RISMARDIQQ IKESFSYAGL ISLVVCLFLA WIAILLLANN PNLEADKLVE HIIYNYTSAG LRGLIGIGVL ALAMSTADSY LNASAVLLTN DIAKPLGIKF KNEVLAARVF CGLSGIFALL IALQFKGILS LLQFANSLYM PVVTVPLLMA IFGFRSSTLA VLIGMAGGLG TTLGWPFIIK ESHGILPGIM ANLIGLLGSH YILKQPGGWV GIREPEPLLE ARENRRKAWN QFKKDFKEFS LVQYLQKSLP NQDYLLTIFG IYVIAATYAS FYTVPEEIQA NYAKLYHIIG QSVLFISTGL LTYPLWPPIF KNKWFITWAW PLSVFYVLFA VGTWLVLMSG FHTFQTMIFL LNVVMGFLLL PWHLVSIMVI IGVTTATYIF KIYAQVPILP DDFGTLRFKI LYGLLLASNF IALFKYQQAQ GKLVSHNQAL NILQAKRTIN LREALQHRER FMHTLAINCV EGFNWLYQQS KILWTSFKPT EITSSYKDLI NEAVLLLAKQ QQASEYLAQT IFPFKNYLRL NVEKVNLANF LNTALENLDK INIQAQPKIT LQQLTHYQEL EIDPVQIQKL LYNTLQTIQA KNHANKLITL LVKDATLVYE MPFIPNYNKE ISAIQFILTT IEQPAKGTTN NHDALETVHI FLPKHIENVP SEENQRIIEA HYGYASCETE NKDITQIYII PVALRKIRPT IIDESKKVLD EVV
|
| |