Gene Aasi_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1031 
Symbol 
ID6376993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1337400 
End bp1338647 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID642682147 
Producthypothetical protein 
Protein accessionYP_001958108 
Protein GI189502391 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1108] ABC-type Mn2+/Zn2+ transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT TTCTTGAATT TTTTACTTTA ACAGATCCAA ATATCCGGTA TGTAGTATTG 
GGAACCATGC TACTTACTAG TAGCGCAGCT ATAGTAGGAA CTTTTACTTT GCTTAAAAAG
AAGGCTTTAC TGGGAGATGC TACAGCACAT GCTGCCTTCC CGGGTATATG TATAGCTTTT
ATGCTTACTG GTACTAAACA TCCAATTTAC TTGGCTATAG GTGCTTTTAC AACTGGTTGG
ATAGCTTTAT TACTCATAGA TATACTAATC AAGCATTCTA AGATTAAGGA AGATATTGCT
ACAGCCTTGT TGCTTTCTGT AACGTTTGGG ATAGGCACCC TACTACTTTC TATTATCCAA
AATACGTCTA ATGCAGCACA ACTAGGGCTT AACAACTACC TATTTGGCAA AGCAGCTGCC
TTGCTAGGCG AAGACTTATT GGTATTAGGT ATACTTAGTA TAGCTATTTT ACTAACAGTT
GTCGGTCTTT TTAAAGAATT TACCTTGATT GCTTTTGATA AAGCCTTTGC TGAATCAATA
CATATGCGGG TAAACTTGCT AGAATTTGTA TTTACTAGCC TTATTGTATT AGCCATTGTA
ATAGGTATCC GTGCAGTAGG TATTGTGTTA ATGGCAGCCA TGCTTATTAC TCCAGCCGCA
GCTGCTCGCT TTTGGACGGA CCGGCTAACA AAAATGATAG GCTTAGCAGC ACTATTTGGA
GCTATATCAG GTCTATTAGG CAGCTTCGTT TCTTACCTTT TTCCTGGAAT GCCTACAGGC
CCTTGGATTG TACTTATTGT TACTGCTATA GCATATATTT CTTTTCTTTT TGCTCCCCAT
AAAGGACTGC TGGCAAAAAA GTTACGTCAG TATAAGCATC AAAATAAGAT ATTACAAGAA
AACATATTAA AGCTATTTTA TGAGATAGGG GAAGAGAAAG GAGATTTTTT TGGAAATTGC
TCTACTGAAG AGCTTATGCA GCACAGAGCT ATACCCCACA ATAAACTTAT AAGGGGGCTT
GTATCCTTAG AGAAAGCTAG CTTACTATAT TATAAAGGAA ATAAATGGCG CCTGACAGGG
GCTGGAAAAA ATGAGGGAGA ACGGATAGCA CAGCGACATC GGTTATGGGA ATTATACCTA
ACTAAATACT TAAAGACCAA ACCGGCTTAT ATTCATGAAA ATGCTGAACT CATAGAACAT
GTACTTACAC CTGAATTAGT TAAAGAATTA AATAAATTGC TAAACTAA
 
Protein sequence
MKAFLEFFTL TDPNIRYVVL GTMLLTSSAA IVGTFTLLKK KALLGDATAH AAFPGICIAF 
MLTGTKHPIY LAIGAFTTGW IALLLIDILI KHSKIKEDIA TALLLSVTFG IGTLLLSIIQ
NTSNAAQLGL NNYLFGKAAA LLGEDLLVLG ILSIAILLTV VGLFKEFTLI AFDKAFAESI
HMRVNLLEFV FTSLIVLAIV IGIRAVGIVL MAAMLITPAA AARFWTDRLT KMIGLAALFG
AISGLLGSFV SYLFPGMPTG PWIVLIVTAI AYISFLFAPH KGLLAKKLRQ YKHQNKILQE
NILKLFYEIG EEKGDFFGNC STEELMQHRA IPHNKLIRGL VSLEKASLLY YKGNKWRLTG
AGKNEGERIA QRHRLWELYL TKYLKTKPAY IHENAELIEH VLTPELVKEL NKLLN