Gene Aasi_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1378 
Symbol 
ID6377605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1754866 
End bp1756293 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content35% 
IMG OID642682452 
Producthypothetical protein 
Protein accessionYP_001958407 
Protein GI189502690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAT TTAGCTACAA CAATAATAAT ATTTTAAACC TTTTTATTGC TCCTACTAAC 
CTAACGGGAG AAAAACATGA AAGATTAAGT GAAGATCCTG CGCTAGCTAA TGAGAAGGAA
AGTAATTTTA GTAAAGCAAT TGCATGTTAC AAAAGAGCTG CTGAAGCAGG TCATTTAGAA
GCACAGCAAA AACTTATTAG CTCTAGAAAT ACAGGACAGT ACAGCAACCC ACATCAGGAA
GCAATGAAGT GGTATAAAGA TGCTATCAAC CGAGCTGCAA ATAGCTGGGG TAAAACAAGC
AGGGAAAATA AATTTTTGAA TTTTACAGAG TTTATACAAA AAGCCGATGC AACTGAAGCA
CCATTAAAAT TATCTGACCT AATCGTTATG GAGCTAGATG CTACCCACGT ACCGACCACG
ATAGAATGGG GAGGTATTTC TACAAGAAAT GGTAAAGCAA GTCATATTTT CGCACCAATC
TTTACGCTAC CTAATGTTAA AAAGTATGAT GATAAGGGGG TCATTGCTAT TGATGTTGCT
GGACCTGGGC TTGAAGCAGA TGAGATAGCT TATTGTGTAG CCAAACGAAG CGGCGAGTAT
TATTTTATCG TTGAATTAGG AAGCATAGTA AAGGATAGCG AAGATAAAAT AATAGAACAA
GTAGGATCAA TTATTAGAAG ACATAAGGAT GTGCTCTACA TCGTTTATGA AAGAAATGGT
GCTGGCTGTT ATTTTGGGAA ACCTGTAAAA GATGAACTTA ATAAATATAA TATAGAGGTA
ATAAAGCTAG AAAAACAAAA AAAGCGATCT AAAAGTTCGA ATATAAAAGA TGAAAAAGGA
GAAGAAGAGG AAGAAGATAC TTCAAAACAA GTAAAGGTAT TACTTGCAAT TAACCAAAAT
AGCAATAACA CAGATACTCA AAAGGGTGAA GGTAAAATAA CTAGGATTAT CTCTACTCTC
GAACCTTTAC TAAACTCACA TAAGCTTATT ATTGACCTAA AGATTCTCCA AGAAGATTTT
CAGAGAATAT CACAACCAGA AGTAGGAGAT TTAAATTGCA GCTTTTTTTA CCAATTACTA
ACAATAGGTG CAAAAGGTGC TCAATTCACC TCTCAAGGTT TTAGTAAACC CACACATGAT
GATCGTATAG ACGCACTAGC AATAGCAATA CAACATTTAA GAGATAGAAA AGAGAAGAAT
CAAACTGTGC AAACCTCACC AATACGTAAA AAAAGGTTAC CATTAAAAGC CATAAAAAAG
CATCAAAAAG CTGCTGAGCA AGGAGATATA AATGCATATT ACAAATTAGG AAAAATATAT
GAAAAGCAGG AAGTGTCGGA GCATTTTGTA ACTGCATTCA ACTATTATTC AATTGCTGCT
ATAAGAGGCT GTCTAAAAAG CCAAAATGGA GTCATAAAAG CCGCTTAA
 
Protein sequence
MDKFSYNNNN ILNLFIAPTN LTGEKHERLS EDPALANEKE SNFSKAIACY KRAAEAGHLE 
AQQKLISSRN TGQYSNPHQE AMKWYKDAIN RAANSWGKTS RENKFLNFTE FIQKADATEA
PLKLSDLIVM ELDATHVPTT IEWGGISTRN GKASHIFAPI FTLPNVKKYD DKGVIAIDVA
GPGLEADEIA YCVAKRSGEY YFIVELGSIV KDSEDKIIEQ VGSIIRRHKD VLYIVYERNG
AGCYFGKPVK DELNKYNIEV IKLEKQKKRS KSSNIKDEKG EEEEEDTSKQ VKVLLAINQN
SNNTDTQKGE GKITRIISTL EPLLNSHKLI IDLKILQEDF QRISQPEVGD LNCSFFYQLL
TIGAKGAQFT SQGFSKPTHD DRIDALAIAI QHLRDRKEKN QTVQTSPIRK KRLPLKAIKK
HQKAAEQGDI NAYYKLGKIY EKQEVSEHFV TAFNYYSIAA IRGCLKSQNG VIKAA