Gene Aasi_1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1479 
Symbol 
ID6376515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp109263 
End bp110321 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003572977 
Protein GI294661102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTAT TTACACAACC AACTTTATTC ATTGTGTTAG TGTGCTTAAT AATATTAGCA 
GCTTGTTCGG ATTGTGATAA TAATGAAACT GAAATCCAGA TAAAAAAACC ACGAACAGTA
TCAGCAGACG AAGTTCGTGT CAAAGCACAG GCCATAGGTA TTATAATAAA TAGCAAAGGG
GATATTGTAC CCCCTTATCC TCCCAAAGCA GGTACAGAAG GCGAGTATAT TTTACATTAT
GCTATCTTTG AAGGAGCAGA TATCCAACTT ATAAAATACT TATTAGAAAC ATTAAACCAA
AAATACCTTG ATAATAAGTT TAAGGGATTA CTTAGTACAC GCGATTTAGA GCAACATACT
CCTTTTATGC GTGCTGTAAA CGGTGGAGAT ACATCAATTA TACAACTTTT AATAGATTAT
GGGTTACGCC CTACAAAAAA CGAAATAGAC TATGCAATTA ATCATAGTTT ATCACAAGAT
AATGCTGAGT TGCTAGCTTA TATATTAAGA CTAGCCACAA AAAACAACGC ACACAGTCAG
GTAAAATATC TTACTGATAT ATTAGGTTCT GTAGCAGAAC AAGGAAAAGC TAACATAGTA
GAGTATATCT TAAAGATGCC AAATGTTGAT ATCTATACTA AGGATGACAA AGGGAATACA
CCTCTCCATA ATGCCATTAA AGCTAAGAGT GAAAAGGTTG TAAGATCATT ATTGAAAAAA
GGAACTAACA ATATAAATAT AAAGAATAAG GAAGGGAAAA CTCCTCTTGT TTTAGCTATT
GAGAATGGTT CACAACCTAT TGTGAGAACT TTGATAAATT TTGGGGCACA GCTCACAAAG
CCTGAACATC CAGAGGACTT TCCTCTCGAG TATAGATTAG CCAGAGAAGA CTTAATAGTA
CAAAAAACAC AAAATGGCAA AGAAGATAAT AATAAGAAAG GTATAGTAAA TATTTTACAG
GGGAGATTTG GTATAGCAGA TAAAAAACTC AAAGAAATTT TAGAGAAGAA ATTAGAGAAA
AAAGAAGTTA CACATTCAAG TCTAGACGAT ATAGATTAG
 
Protein sequence
MRLFTQPTLF IVLVCLIILA ACSDCDNNET EIQIKKPRTV SADEVRVKAQ AIGIIINSKG 
DIVPPYPPKA GTEGEYILHY AIFEGADIQL IKYLLETLNQ KYLDNKFKGL LSTRDLEQHT
PFMRAVNGGD TSIIQLLIDY GLRPTKNEID YAINHSLSQD NAELLAYILR LATKNNAHSQ
VKYLTDILGS VAEQGKANIV EYILKMPNVD IYTKDDKGNT PLHNAIKAKS EKVVRSLLKK
GTNNINIKNK EGKTPLVLAI ENGSQPIVRT LINFGAQLTK PEHPEDFPLE YRLAREDLIV
QKTQNGKEDN NKKGIVNILQ GRFGIADKKL KEILEKKLEK KEVTHSSLDD ID