Gene Aasi_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1038 
Symbol 
ID6376895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1343536 
End bp1344999 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content34% 
IMG OID642682154 
Producthypothetical protein 
Protein accessionYP_001958115 
Protein GI189502398 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATT TCAGTACTGA TGCCATCATT GTATATGCCT TTTTATTCAT AACCTTAGCA 
GTAGGATTAT GGGCAGGAAG AAATGTAAAG TCTATTAAAG AATATGCAAT TGCTAATAGG
ACTTATGGAA CAGGGGTACT TACGATAACC ATGTTGGCTA CTTATCTTAC TGGTTCTCAG
GCTATAGGTT ATGCAGGCCA TGTTTTTGAT AATGGTGTAT TTTTCCCTTT TATTACAAGA
GTTTTCTGTG GTGTTATTAT ATGCTTTTTG TTTATTGCAC GTTACATAGC GCCAAAAATG
TATCGTTTTG CAGGATGTTT GACATTGGCA GAAATAATGG GGAAACTATA TGGTCCTAAA
GTACGTATAT GGATCGGCAT TCTAGGAACT TTATATTCTC TAATCATGGT TACGCTACAA
ATTATCTGGT TGGGCTATAT AGGGGACTTC ATTAATATTC CTAGTCAGTG GAGTATTTTC
TTAGGAGGAG TTTTTTTAAT GTTTTATGCT AGCAGAGGAG GCATGAAAGC TGTAGCTATC
ACAGATATAT TACAATTTGT TGCCATTACT ATACTGATAC CTTTAGCCGC TAATGTCTTA
TTGCATAGAT TTGATGGAAT AAGAGATATG TTTACTCACG TTCCATCCGA AAATTTTAAT
TTCTTTCAAC ATCTGAATAT AAATGAATTC TTGATTCCTT TTTTATGGTA CCTCTTCCCT
GCTTTTCCAC TTAGCTTCCC ATTCATGCAA CGTATGCTCA TGGCAAAGGA CACGCGCCAA
ATAGCTAATA GCCATTATAT AGCTACATTT TATTTAATAG GGTTTTATCT ATTACTTACT
TTTATTGGTC TAGCAGCTAT AGCTTTAAAA ACAATGGGAG ATGTAAATAT TCCACACCAG
GGTAGTAAGA TATTTATATA TTTGGTTAAA ACATATTTTC CAGTAGGCAT CAAAGGCATA
GTAGGTATCG GATTGTTAGC AGCTGTTATG TCTACGGCAG ACTCTTTCCT GCATAGTGCA
GGTATGCTAG TAGCACATGA TGTAATTGGA CCTTTACTGC AAACAAAAAA AACTAAAATT
GATGTTTTAA AAATAAGTCA ATACGCAACA TTTTCTCTTG GCTTAATAGC TTTTTGCATA
GCATTAAGTT ATCAATCATT GCCTCGTATA CTGTATGGAG ATATACATTG GGGTAAAGGG
ATAAATATAT TTAGGGATTT TGTAGCGATC GTATTTACTA TTCCTATGAT AGCGGGTATC
ATGGGCTTAA AAACGGATGC TAAGTCTTTT TTTATTTCGT TAATAGCTAC TTGTATTACC
TTTTTTATAG GAAAATTATT TTTATCAGAT TTATGGTTTA TGCCTGTTAC CATTATAATT
AATGCGGTGA GCTTTTTTGG GGCGCATTAT CTACAAAATA AAGGGTTTTT AACTGTAAAA
AGAGATGAAA TATTAGTAAC TTAA
 
Protein sequence
MNYFSTDAII VYAFLFITLA VGLWAGRNVK SIKEYAIANR TYGTGVLTIT MLATYLTGSQ 
AIGYAGHVFD NGVFFPFITR VFCGVIICFL FIARYIAPKM YRFAGCLTLA EIMGKLYGPK
VRIWIGILGT LYSLIMVTLQ IIWLGYIGDF INIPSQWSIF LGGVFLMFYA SRGGMKAVAI
TDILQFVAIT ILIPLAANVL LHRFDGIRDM FTHVPSENFN FFQHLNINEF LIPFLWYLFP
AFPLSFPFMQ RMLMAKDTRQ IANSHYIATF YLIGFYLLLT FIGLAAIALK TMGDVNIPHQ
GSKIFIYLVK TYFPVGIKGI VGIGLLAAVM STADSFLHSA GMLVAHDVIG PLLQTKKTKI
DVLKISQYAT FSLGLIAFCI ALSYQSLPRI LYGDIHWGKG INIFRDFVAI VFTIPMIAGI
MGLKTDAKSF FISLIATCIT FFIGKLFLSD LWFMPVTIII NAVSFFGAHY LQNKGFLTVK
RDEILVT