Gene Aasi_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1041 
Symbol 
ID6377048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1348633 
End bp1350090 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content34% 
IMG OID642682157 
Producthypothetical protein 
Protein accessionYP_001958118 
Protein GI189502401 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.880803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACT TTAATGTAGA TGCTATAATC GTATATGCTT TTTTGTTGCT GATTTTACTA 
CTAGGTTTAT GGGCAGGAAA AAATGTTAAG ACTATTAAAG AATATGCAAT TGCGAATAGA
GCATATGGGA CTGGTATATT AACAATAACT ATGCTAGCTA CCTTTATTAC AGGTTCACAA
TCTATAGGCT ATATAGGATA TATATATGAT GATGGAATTT TGCCTGTAAT TCCGATAATT
TTTTGCAGGG CTATAATTGG TTTCTTGTTT ATTGCACACT ATATATCCCC TAAAATTTTA
TATTTCGAGG GGTGCTTAAC ATTAGCAGAG GTTATGGGAA AACTATATGG TGGTATGGCC
CGGACCTGGA TAGGCTTTTT AGGTGTCCTT TATTGCTTGG CTTTTGTTAT ACTGCAAATT
ATCTGGATGG GATACATCGG AGAGCTTATT AATGTTCCCA ATCAGTGGGG TATGCTATTA
GGAGGCGCTT TTTTAATTAT CTATTCTGCC AGGGGTGGTA TGAAGTCAAT AACTATTACC
GATATATTAC AATTTATCTC TATTACTATG TTAGTAACTT TAGCAGTTAA TGTACTAATT
CATAAAATAG GAGGTATAGA TAATATATTC AATAAAGTTC CAACAAATAC TTTTAAAATC
TTTCAGAACC CTAACTTCAA GAATTATCTA GTTTATTGTT TATGGGGGGC ATTCCCTTCC
TATTTAGTCA GCTTCCCATT CATCCAGCGT ATGCTCATGG CTAAGGATAA AAGGCAGCTT
GCCAAGAGTC AATATATAGG AATGTCTTAC TTAACAATCT TCTATATGTC TCTTACTTTG
ATTGGTTTAG CTGCTATAGC ATTGAAAACA ATAGGAGATG TCAATATGCC TAAGCAAGGA
AGCAGAGTCT TTATATACTT GGTTGAGGCT TATTTTCCTG TGGGTATAAA GGGGATCATT
AGTATAGGCT TATTGGCTGC TGTTATGTCT ACAGCAGATT CTTTCTTGCA TAGTGCAGGT
ATTTTAATAG CTTATGATGT AGTACAACCT TTATTAGCAA AGAAATATGA GGTTAATGTT
TTAAGGACAA GCCAGTACGC AACATTTTTT CTTGGAGTAA TATCTTTAGG AGTAGCATTA
ATTTACGATA TACTGCCTCG TGTGCAATAT GGAACTATGG ATTTAGGAAA AGGAATAAAT
ATACTGAGAG ATTTTGTTGC TGTCGTGTTT ACCATTCCTC TTTTGGCCGG TATTATGGGC
CTTAAAACAG ATGCTAAATC GTTTTTTGTT TCTATGATCG CTACTTTTAT TGCTTTTTTT
ATAGGTAGGT TATTTTTGCC TGATTTGTGG TTTATGCCTA TGGTTATTGC AGTCAACAGT
ATCACATTTT TTGCTACTCA TTATATTCAG AATAAAGGAT TTGTAACTGT AAAACGTGGT
ACTGTTGTTT TATCTTAA
 
Protein sequence
MNYFNVDAII VYAFLLLILL LGLWAGKNVK TIKEYAIANR AYGTGILTIT MLATFITGSQ 
SIGYIGYIYD DGILPVIPII FCRAIIGFLF IAHYISPKIL YFEGCLTLAE VMGKLYGGMA
RTWIGFLGVL YCLAFVILQI IWMGYIGELI NVPNQWGMLL GGAFLIIYSA RGGMKSITIT
DILQFISITM LVTLAVNVLI HKIGGIDNIF NKVPTNTFKI FQNPNFKNYL VYCLWGAFPS
YLVSFPFIQR MLMAKDKRQL AKSQYIGMSY LTIFYMSLTL IGLAAIALKT IGDVNMPKQG
SRVFIYLVEA YFPVGIKGII SIGLLAAVMS TADSFLHSAG ILIAYDVVQP LLAKKYEVNV
LRTSQYATFF LGVISLGVAL IYDILPRVQY GTMDLGKGIN ILRDFVAVVF TIPLLAGIMG
LKTDAKSFFV SMIATFIAFF IGRLFLPDLW FMPMVIAVNS ITFFATHYIQ NKGFVTVKRG
TVVLS