Gene Aasi_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0697 
Symbol 
ID6376888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp887580 
End bp890441 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content37% 
IMG OID642681848 
Producthypothetical protein 
Protein accessionYP_001957815 
Protein GI189502098 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGGTT TGAATATAGA CATGTTAATT CTCGTCTTAT TTCTAGGTAT TAACCTAGTC 
ATAGGACTTT TCTCTAGCCG CCGAGTAACC TCTCTACAAG ATTATGCAAT AGGCAGAAAA
GATTTTTCTA CAGCTACCTT AACAGCTAGT ATTGTAGTTA GCTGGGTTGG TAGTTGGTAC
GTTTTTGAAA CGCTAGGGCA TACCTATACA GATGGACTAT ATTTTATCAT AGCTATCACT
GGTGCATGCA CCTGCTTGGT TATTGTTGGG TTATTAGCTG TACGCATGCA GGAGTTTTTA
AAAAACATCT CCGTCGCAGA AGCTATGGGA GGCATTTACG GTAAAACTGC ACAAACTATT
ACAGCTATTA GTGGGGTTTT AAGCGTATTG ACAATAGTAG CGATGGAATT TCATGTAATT
AGCCGAATCA TTAGCTTGAT ATTTAATACT GAAAGTATCT GGACGCCTGT AATTGCTGCT
GTCGTGGTAA TTGTGTACTC AGTTTCTGGA GGTATCCGTG CAGTTACTTT TACAGATGTA
ATACAGTTTT TTACATTTGG TACGTTTATT CCTATCTTGG CACTAGTCAT CTGGAATCAG
CTAAAAGACC CAAATCAGGT AATAACCTTG CTTAATACCC ATCCTAATTT TAGCTGGTCT
CACGTAATAG GATGGGATCC TAAATTTTTA GATGCTTTGG CAATGATGCT ATGGTTTCTC
ATCCCTGCTA TGGATCCTGT TATTTTTCAG CGGATTAGCA TGGCGCGGGA TATACAGCAA
ATAAAAGAAT CTTTTAGCTA TGCTGGTCTC ATTAGCTTAG TCGTGTGTTT GTTTTTAGCT
TGGATAGCTA TTTTATTATT AGCTAACAAT CCAAATTTGG AAGCTGACAA GCTTGTCGAA
CACATTATTT ATAATTACAC GTCAGCTGGC TTACGTGGGT TGATAGGCAT AGGTGTTTTA
GCTCTGGCTA TGTCTACGGC TGATTCTTAC CTGAATGCTT CTGCTGTTTT ATTAACCAAT
GATATTGCCA AGCCCTTGGG CATAAAATTC AAAAATGAAG TGCTGGCTGC CAGGGTTTTT
TGCGGCTTGT CGGGTATATT TGCCTTACTT ATTGCTTTGC AATTCAAAGG AATCTTGTCA
TTACTGCAAT TTGCCAATAG CCTGTATATG CCGGTTGTTA CAGTACCCTT GTTAATGGCT
ATTTTCGGAT TTAGAAGTAG CACATTAGCT GTATTAATAG GGATGGCTGG GGGGCTTGGA
ACGACACTCG GTTGGCCTTT TATTATTAAA GAAAGCCATG GTATTCTTCC AGGAATCATG
GCTAACTTAA TAGGATTACT AGGTAGCCAT TATATTCTCA AACAACCAGG TGGTTGGGTA
GGTATACGCG AACCAGAACC TTTGTTAGAA GCAAGAGAGA ATAGACGCAA AGCTTGGAAC
CAATTTAAAA AAGACTTTAA AGAGTTTAGC TTAGTACAGT ATCTGCAAAA GAGCTTACCT
AATCAAGATT ATCTATTGAC CATTTTTGGA ATCTATGTCA TTGCTGCTAC TTATGCTTCC
TTTTATACAG TACCAGAAGA GATACAAGCC AATTATGCCA AGCTTTATCA TATTATTGGT
CAAAGCGTAC TATTTATTAG CACAGGCTTG CTAACCTATC CCCTATGGCC TCCTATTTTT
AAGAACAAGT GGTTTATAAC CTGGGCCTGG CCTTTGAGTG TTTTTTATGT ACTCTTTGCA
GTGGGTACTT GGCTAGTACT GATGAGTGGC TTTCATACTT TCCAAACTAT GATCTTTCTT
CTGAATGTGG TGATGGGATT CTTATTACTT CCTTGGCATC TAGTAAGCAT TATGGTCATT
ATAGGTGTAA CAACTGCTAC TTATATTTTT AAAATATATG CACAAGTACC TATCCTTCCT
GACGACTTCG GTACCTTACG ATTCAAAATA TTATATGGTC TGCTACTAGC AAGTAATTTC
ATAGCTTTAT TTAAGTATCA GCAAGCACAA GGAAAGCTAG TAAGCCACAA TCAAGCCCTT
AATATTCTTC AAGCTAAGCG CACTATCAAC TTACGGGAAG CATTACAACA CCGGGAACGG
TTTATGCATA CTTTAGCTAT CAATTGTGTA GAAGGCTTTA ATTGGTTGTA CCAACAAAGC
AAAATACTTT GGACGTCTTT TAAACCAACA GAAATAACTT CATCATATAA GGACCTAATA
AATGAAGCGG TACTACTTTT AGCTAAACAG CAACAAGCTA GCGAATACTT AGCACAAACC
ATTTTTCCTT TTAAAAATTA TCTACGCTTA AATGTAGAGA AAGTTAACCT AGCAAATTTC
CTAAATACTG CGCTAGAAAA TTTAGATAAG ATAAATATAC AAGCCCAACC AAAAATTACT
TTACAACAGC TTACACACTA TCAAGAATTA GAAATAGATC CTGTACAAAT TCAAAAGCTG
TTATACAATA CTTTGCAAAC TATACAGGCA AAAAATCATG CCAATAAGCT TATTACCCTT
TTGGTAAAAG ATGCCACTTT AGTTTATGAG ATGCCATTTA TTCCTAACTA TAATAAAGAA
ATATCAGCTA TACAATTTAT ACTTACGACT ATTGAGCAAC CAGCAAAGGG TACAACTAAT
AACCATGATG CATTAGAAAC TGTCCATATA TTTTTACCCA AGCATATAGA AAATGTACCT
AGCGAAGAAA ACCAACGAAT TATAGAAGCT CATTATGGAT ATGCAAGTTG CGAAACTGAA
AATAAAGATA TCACCCAGAT CTACATCATT CCAGTGGCAC TAAGAAAAAT TCGGCCAACA
ATTATAGATG AAAGCAAAAA GGTATTAGAT GAAGTAGTGT AA
 
Protein sequence
MLGLNIDMLI LVLFLGINLV IGLFSSRRVT SLQDYAIGRK DFSTATLTAS IVVSWVGSWY 
VFETLGHTYT DGLYFIIAIT GACTCLVIVG LLAVRMQEFL KNISVAEAMG GIYGKTAQTI
TAISGVLSVL TIVAMEFHVI SRIISLIFNT ESIWTPVIAA VVVIVYSVSG GIRAVTFTDV
IQFFTFGTFI PILALVIWNQ LKDPNQVITL LNTHPNFSWS HVIGWDPKFL DALAMMLWFL
IPAMDPVIFQ RISMARDIQQ IKESFSYAGL ISLVVCLFLA WIAILLLANN PNLEADKLVE
HIIYNYTSAG LRGLIGIGVL ALAMSTADSY LNASAVLLTN DIAKPLGIKF KNEVLAARVF
CGLSGIFALL IALQFKGILS LLQFANSLYM PVVTVPLLMA IFGFRSSTLA VLIGMAGGLG
TTLGWPFIIK ESHGILPGIM ANLIGLLGSH YILKQPGGWV GIREPEPLLE ARENRRKAWN
QFKKDFKEFS LVQYLQKSLP NQDYLLTIFG IYVIAATYAS FYTVPEEIQA NYAKLYHIIG
QSVLFISTGL LTYPLWPPIF KNKWFITWAW PLSVFYVLFA VGTWLVLMSG FHTFQTMIFL
LNVVMGFLLL PWHLVSIMVI IGVTTATYIF KIYAQVPILP DDFGTLRFKI LYGLLLASNF
IALFKYQQAQ GKLVSHNQAL NILQAKRTIN LREALQHRER FMHTLAINCV EGFNWLYQQS
KILWTSFKPT EITSSYKDLI NEAVLLLAKQ QQASEYLAQT IFPFKNYLRL NVEKVNLANF
LNTALENLDK INIQAQPKIT LQQLTHYQEL EIDPVQIQKL LYNTLQTIQA KNHANKLITL
LVKDATLVYE MPFIPNYNKE ISAIQFILTT IEQPAKGTTN NHDALETVHI FLPKHIENVP
SEENQRIIEA HYGYASCETE NKDITQIYII PVALRKIRPT IIDESKKVLD EVV