Gene Aasi_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0974 
Symbol 
ID6377117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1272089 
End bp1273198 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content40% 
IMG OID642682098 
Producthypothetical protein 
Protein accessionYP_001958059 
Protein GI189502342 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2971] Predicted N-acetylglucosamine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.131863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAGT ACCTCAAAAT AAAAATAGAC ATGGTAAACA GGATGTTTCT AGTAGCAGTT 
TCCCTTTTTT CTATTCTTGG CCAACAGGTT ATTGCCGATG ACCTAATAGT AGTACCTCAT
GCTGACTACA TTTTATGTAT TGATGGACGT GGTTCTAAAA CTTCTTTACA AGTTGTTACT
ACCCAAGGAG CAGTTATTCC TTTACAAGGA CCCGCGGGCA TAGTACAAGA AATTTATACC
GAAGGAAGTA ACGTTGCAAG TTTGGGTTGG GATTTAGTAC AGAAACGACT AGAAAAACTT
TTAAACCAAG TTAAGTTCCC TCCAGGTAAC AATCCTTTAC AGAATAAAAG CTCTGTTGCA
GTTGTTGCAG GTTTTGCAGG CATTGGGCTC CCAGAAATAC GCCAAAAATT TATCGATTTA
TTTCAACAAT GGGGTTTGAA CCCAGATAAA ATTGTTGTAA CCACAGATAT TAACTTAGCT
AAAGAGCTTT TAAGCCAAAA AGATGGGGCT GTATTGATTG CCGGATTAGG CTCTGTTGCT
TTTGTCAAGC ATCAGGGACA TTGCTTGCGC TTTGGGGGGC TTGGATGGTA CTTAGGAGAT
GAAGGGAGTG GTTTCTCTGT AGGAAAAAAG GCTATAGCTG CAGCTATAGC TGAAGATAAG
GGTTTTGGTA TGAAAACAGC TTTGACTCCT ATTTTAAAAG AAATGTTTCA AAAACAAGAA
CTATATCGTC TAATTCCACT TTTGCAGGAT GGCACCATCA GTTCTGAACA GGTAGCAGCG
ATTGCTCCAA TAGTTTTTGA ATGCGCTTAT AGTAAAAAAG ACCCGGTAGC ACACCTTATT
GTCAAGCTGG CAGCACAGGA GTTAGCTAGT TTAATTCGCC AAGGGATAGA GATGATTCAG
AAAGAGTTAA AGCCTTTACC TGCCAACTGG CCGATCTACT TGATTGGTGG CCAATTTAAA
GGTCCCTATG CACAGGCTTG GACCCAAGAA CTCTGGTCAT TTTTACCACA AAGGGGAAAA
ATGGTTCCAC ACAATCTAGC TAAGTCTAAT ACTACTACGG TGGTAGTGCA GCAAAAGTTA
GCTGCAAGAC GAAACAAAAG GGGTTGGTAA
 
Protein sequence
MFKYLKIKID MVNRMFLVAV SLFSILGQQV IADDLIVVPH ADYILCIDGR GSKTSLQVVT 
TQGAVIPLQG PAGIVQEIYT EGSNVASLGW DLVQKRLEKL LNQVKFPPGN NPLQNKSSVA
VVAGFAGIGL PEIRQKFIDL FQQWGLNPDK IVVTTDINLA KELLSQKDGA VLIAGLGSVA
FVKHQGHCLR FGGLGWYLGD EGSGFSVGKK AIAAAIAEDK GFGMKTALTP ILKEMFQKQE
LYRLIPLLQD GTISSEQVAA IAPIVFECAY SKKDPVAHLI VKLAAQELAS LIRQGIEMIQ
KELKPLPANW PIYLIGGQFK GPYAQAWTQE LWSFLPQRGK MVPHNLAKSN TTTVVVQQKL
AARRNKRGW