Gene Aasi_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1223 
Symbol 
ID6376909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1562264 
End bp1563568 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content38% 
IMG OID642682319 
Producthypothetical protein 
Protein accessionYP_001958277 
Protein GI189502560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000959128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTTTA ATTCATTCCA GCAACTGATA GCACGTCTTC TACTTATAAG CTTATTCTTA 
CAAAGCTGTG GTGGAGGATT CGACAATAAC CCACTTATTC CTACCGGGGA AGAGCAAGTA
GCATCTATAC AAACTACTAC ACAAGCAATC CTTCCTCGAG CAGATATCCA GCCTTTGACA
GGTCAAGTAT TGACAGCAGA AGGTGGCCAT GCTGTTACTT TCTATAAGGA AGCAGGTGAG
TTAAAAGCTA ATGTAGCAAT GGACGTACCT GAAGGATTTA GTAAAACCTA TGAGGGAGTG
GAAGTATTAT TAGAGCAGGG AGCAGAGTTA TCGGACCTAC CTCGATTAAG TGAGCAAGCA
CAACAACGAC GTATTTATCT TCAACCAGCA CAAGGCAACC AGCCAGCTAA AGTAGTTATC
TATAAAGGAG TAGGATTGAT GGGAGGAGGG AGTAGTGAAG ACGAAGGAGA GGAAGAAGGA
ACATATCAAC TGGTGGTGGA GAGCGGAGAA AAGGAAGCCG AAGAAATTGA GCAAGAAAAA
GAAAAGCTAC AAATAATTAG ACATACTAAA AGGGGAGTTG CTGAAGCACA CTATCATTAT
AATTTATGGA GAGAGGATTT TTTAAAGCTA CAACCTTTAA CTTCTGTAAG AATACAGCCT
GAGAAAGTTC CAATGCAGGA ATTAATGAAG CTTTTTGAAA TTAAAGAAGG AGAGTTTTTA
GAGAAACAAG TGAAGGAAGT TTTGGAAGAT GCAGATGGTA TCATTCCAGA ACCAGAACCA
GAACCATTTA ATAAAAGACA TAAAATATAT GGACAGTTCA TTATTTGTGA GCAGACCCTA
CCAGCAGATG ACGGGGGATA CCCAGGTAAT CTTCCAGATT TTAAAAAACG AACGAGGGTT
GGATTTAATG GTGATGGGTT ACTTTATCAA AACAATAGTA TTTTAGATTC CTCTGAACAA
GCACTGTGGG CTTTAAATCC AAAAGGTAAG ATGTGCATCT TTTTTAGAGA TAGACATCCT
GATATTCCCA GTCAAGTACA TCACACTTTC TTTTTCAAAA CAAGTGGTAT TGGCAAACCT
GTTGCATGTA GTGGTATTAT TAGAGTTTGC AAAGGTAAAA TTGTGAGTAT TGATAATGAT
AGTGGTAGGT ATCAGCCAAG CGTTACTCAG TTGCTGTTAG CAGCAAAATA TCTATTTAAT
AAAGGTATTT TAGATCCTAC TATAAGCGTC AATGATGTAG TAAGAGATAA AAGTTTTACA
TTAAAAGAAA TGCTAATTTT TGCACATTCT CTTGACCTAA CCTAA
 
Protein sequence
MNFNSFQQLI ARLLLISLFL QSCGGGFDNN PLIPTGEEQV ASIQTTTQAI LPRADIQPLT 
GQVLTAEGGH AVTFYKEAGE LKANVAMDVP EGFSKTYEGV EVLLEQGAEL SDLPRLSEQA
QQRRIYLQPA QGNQPAKVVI YKGVGLMGGG SSEDEGEEEG TYQLVVESGE KEAEEIEQEK
EKLQIIRHTK RGVAEAHYHY NLWREDFLKL QPLTSVRIQP EKVPMQELMK LFEIKEGEFL
EKQVKEVLED ADGIIPEPEP EPFNKRHKIY GQFIICEQTL PADDGGYPGN LPDFKKRTRV
GFNGDGLLYQ NNSILDSSEQ ALWALNPKGK MCIFFRDRHP DIPSQVHHTF FFKTSGIGKP
VACSGIIRVC KGKIVSIDND SGRYQPSVTQ LLLAAKYLFN KGILDPTISV NDVVRDKSFT
LKEMLIFAHS LDLT