Gene Aasi_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1862 
Symbol 
ID6377250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1548988 
End bp1550079 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content35% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573228 
Protein GI294661352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000036764 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTTA AGTACTCGCT GTTTGAGAAG TTCGCAGCAT ATATACTACT TGTTATAAGC 
CTGTTTTTAC AGAGCTGCAA TAGCCCTAAC GTTCAACCTA TTGCACCACA ATCACCTAAC
ATTTCTACGG AAGAGCAGAT GATTTCTTTT TACCAAGAAG CAGGCCAATT AAAAGCAATT
GTTAAGCAGG GCCACGGCAG CTTAAGTGCA TCCTATACCT TACCAGTGTA TATATCTTCT
GATGTGAGCT TAGCACAACT AGTTACCTTA ACCCAAGAGG GGCAAAAACA ACTGATCCAT
ATAAACTTAC CTAAGGAGGG GCAGCATGGT TATGTATGTG TGGGGCATGG AGATTTAAGA
GATAAAGAGG ATAAGCAAGA ATATGCTGCT ATAGTAGGAG ATAGTAACCA AAAGGAAAAG
GAAAAAGAAG GACAAGTATC TGAAAAAGTT GCCTATAGAA AGATTTCTAC AGAAGATGCT
GAAGGGGAAG AAAAATTTTT AAAAGTGCAA GAGTTAGAAG ATCGAATTAT TCAGCATATT
AATCGAGCGA AAGATGAAGC GATCATCAAG AGTATAAATT ATACACTAGA AAGTTTTAGT
CCTAATGGGA AATGGAGTGC AAAAGTAATT TTGGATGCCT TTAGAGAAAA GAAAGAAATT
TTAGATTTTG GGCATGTGGC TTTTCATGAA CGTGATTTTG AATTAATTTC AAACCATCCT
TTTTTTAGAG AAAAGGCAAA AGGGATTAAA TTTAGTAATC TGGATAATTT AGCAATTGGA
GGAGGACGAG GTAAATTTAT CAGGTCCTTG GCAACAAGCT TACAATCTAC CAATATTATA
GAAGTTGAAT TAAGCAATAG TAACCTGAAG GCTGAAAGTA TATCCTTATT TGTTACGAAT
TTAAAAGGTG CCCAAGTAAA GAGGGTCAAC TTTAAATACA ACCAAGTAGG CAATGAAATA
CAACAATGGC TTTCTAGTAA TTACTTGGCA AATACCACTA TAGAAGAAAT TAATTTAAAA
CATAATAAGG TAGAAACAAG TATACAGCAA TTGTTAATTG AAAAGTATAA TCACATTCAC
TGGATATTTT AG
 
Protein sequence
MKLKYSLFEK FAAYILLVIS LFLQSCNSPN VQPIAPQSPN ISTEEQMISF YQEAGQLKAI 
VKQGHGSLSA SYTLPVYISS DVSLAQLVTL TQEGQKQLIH INLPKEGQHG YVCVGHGDLR
DKEDKQEYAA IVGDSNQKEK EKEGQVSEKV AYRKISTEDA EGEEKFLKVQ ELEDRIIQHI
NRAKDEAIIK SINYTLESFS PNGKWSAKVI LDAFREKKEI LDFGHVAFHE RDFELISNHP
FFREKAKGIK FSNLDNLAIG GGRGKFIRSL ATSLQSTNII EVELSNSNLK AESISLFVTN
LKGAQVKRVN FKYNQVGNEI QQWLSSNYLA NTTIEEINLK HNKVETSIQQ LLIEKYNHIH
WIF