Gene Aasi_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1099 
Symbol 
ID6376495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1413076 
End bp1414086 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content35% 
IMG OID642682211 
Producthypothetical protein 
Protein accessionYP_001958171 
Protein GI189502454 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.299802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAG CAGTAGTTGT CATACTTAAT CATAATGGCA AAGCTCTTTT ACAAAAGTTT 
CTACCCAGTG TAATACAACA TAGCCATCCT TATAGGGTAG TAATAGTAGA TAATGCATCG
GTAGATGATT CTATTAATTT TTTATCTACT AATTTTCCTC ATATACAGTG TATACGCCAT
ACTAAGAACG AAGGTTTTGC TGGTGGGTAT AATTTGGCTT TACAAGAAAT TAAAGCTAAA
TACTATATAT TAATGAATGC AGATGTTAAA GTTACTAGTA ATTGGATAGA ACCTGTTTTA
GAGTTAATGG AAGGAAATGA GCAGGTATCT GCTTGCCAAC CTAAAATATT ATCATACCAT
AAGCATGAGA AATTTGAATA TGCGGGTGCA ACAGGGGGAT TTATAGATTT GTTAGGTTAT
CCTTTTTGTC GGGGGCGTTT ATTTACTAGT ATAGAGAAAG ATCTAGGTCA GTATAATGAT
ACGCGTGCAG TGTTTTGGGC TAGTGGCGCT TGCATGTTTC TACGAGCTAG CGTCTTTGGG
GAGCTAGGTG GGTTTGATAA ACTTTTATTT GCCTACTATG AAGAAATTGA TCTTTGCTGG
CGTATGCAAC AGTATGGGTA TAAGATTTAT TATTGTGGCA ATAGTAAGGT ATTCCATGTT
GGAAGTGCAA CTATTGGTAT AGATAACCCA TATAAAACTT ATCTGAAATT TAGAAATCGA
GCGCTTGTTC TTTATAAAAA CACACCAAGC CATTTTTTAA GCTGGAAACA CATTTTGCGT
ATCATATTAG ATTTGTTAGC AGCTTTGCAA GCTGTTTTGC AAGGGCGAGC TAAACACAGT
TGGGCTATTT TACAAGCACA GATCGATTTC TTTAAACTAA AAAAGAATTA TAAACCAACT
TTAAATACAC AGCAGGTCAA GCAAGTGTAC CATGGCATCC TTCCTTTTGT TTACTTTATA
CAAGGAAAAA AAAAGTTTTC TGATTTAAAC CAAGCTAAGT TTAGCAAATA G
 
Protein sequence
MEKAVVVILN HNGKALLQKF LPSVIQHSHP YRVVIVDNAS VDDSINFLST NFPHIQCIRH 
TKNEGFAGGY NLALQEIKAK YYILMNADVK VTSNWIEPVL ELMEGNEQVS ACQPKILSYH
KHEKFEYAGA TGGFIDLLGY PFCRGRLFTS IEKDLGQYND TRAVFWASGA CMFLRASVFG
ELGGFDKLLF AYYEEIDLCW RMQQYGYKIY YCGNSKVFHV GSATIGIDNP YKTYLKFRNR
ALVLYKNTPS HFLSWKHILR IILDLLAALQ AVLQGRAKHS WAILQAQIDF FKLKKNYKPT
LNTQQVKQVY HGILPFVYFI QGKKKFSDLN QAKFSK