Gene Aasi_0590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0590 
Symbol 
ID6376392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp759643 
End bp760701 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content40% 
IMG OID642681746 
Producthypothetical protein 
Protein accessionYP_001957720 
Protein GI189502003 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTA ATCTACCATT ATTTAAAAAA GTCAGTGAAA CACCCGGTGC TCCCGGATTT 
GAGCAGCGTA TTCGCCAACT AATTATAGAG GAAATAAGAA CTTTTGTAGA CCATGTAGAG
GTTGACCATA TGGGTAATCT TATTGCTGTA AAATATGGTG TTCAGCAACC AAGCCATGAA
AAGAAAGTAA TGGTAGCAGC ACATATGGAT GAGTTGGGGC TGATAGTTAA ATACATAGAC
CAAGAAGGTT TTATTAGGTT TCATACATTA GGTGGATTTG ATCCTAGAAG TCTTATTGGT
CAGCGGGTGA TCATACATGG TAAGCAAGAT TTGGTAGGCG TCATCGGCAT AAAAGCCATA
CATTTTATGA CAGAAGAAGA AAGGAAACGC CCACTCGAAA TTAGTGATTT ATACATTGAC
ATAGGTAGAA CGCAACAACA GGCTGCGACC TATATATCTA TAGGTGATTC TATTACACGC
GAAAGAAGCT TGATAGAGTT AGGTAATTGT ATTACTGGCA AGTCGCTAGA TAATCGAACA
GGTGTATTTG TGCTTATAGA AGCACTACGT ACCCTACAAG AAGTACCTTA TGATGTTTAT
GCTGTTTTTA CAGTGCAAGA GGAAGTGGGC CTACGAGGTG CACAAGTGGC TGCACATCAT
ATTGAGCCTT ATTTTAGCTT GGCATTAGAT ACCAGTACTT CATTAGATGT GCCTAATGTG
CAACCCCATG ATAGGGTCGC TAGATTAGGT GATGGGGCAG GAATTAAAAT TATGGATGGC
CATACCATAT GTGACTGTCG AATGGTAGAC TATCTGAAGA CAATAGCAAC TCAACATAAT
ATTGCTTGGC AAACAGATAT TAAAGCAGTA GGAGGAACTG ATACTGCTCC CTTGCAACGT
ATGCCTAAAA AAGGATCTAT TGCAGGAGCC TTAAGTATTC CTATACGCTA TGCTCACCAA
GTGGTAGAAG TAGTACATCA AGCCGATGCA ATATCTGCTA TACAGCTTTT ACAACAAGCT
TTAGTTGGCT TAGATACTTA TAGTTGGGAG CAAGTTTAG
 
Protein sequence
MQLNLPLFKK VSETPGAPGF EQRIRQLIIE EIRTFVDHVE VDHMGNLIAV KYGVQQPSHE 
KKVMVAAHMD ELGLIVKYID QEGFIRFHTL GGFDPRSLIG QRVIIHGKQD LVGVIGIKAI
HFMTEEERKR PLEISDLYID IGRTQQQAAT YISIGDSITR ERSLIELGNC ITGKSLDNRT
GVFVLIEALR TLQEVPYDVY AVFTVQEEVG LRGAQVAAHH IEPYFSLALD TSTSLDVPNV
QPHDRVARLG DGAGIKIMDG HTICDCRMVD YLKTIATQHN IAWQTDIKAV GGTDTAPLQR
MPKKGSIAGA LSIPIRYAHQ VVEVVHQADA ISAIQLLQQA LVGLDTYSWE QV