Gene Aasi_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0861 
Symbol 
ID6377122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1096513 
End bp1098018 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content37% 
IMG OID642681998 
Producthypothetical protein 
Protein accessionYP_001957959 
Protein GI189502242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.157746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA GAACAAACAA TTACTTAAAA CTTGAAAATA GGTTAAAACA AATTACCCAC 
TTAAAGAATA TTGCTTCACT TGCTCATTGG GATGCAGAAA TTAATCTACC TACAGCTTCT
ACAGCCAACC GACATCAAGA ACTAGCGACG CTAGCAGAAA TTATTCACCA AATGTCGGTT
GCCAAAGAAT TAGGTGATTT AATTGAAGCC GCAACTCAAG AAGTGAGCGA GCTTAATGAG
TGGCAAAAGG CTAACCTGGC ACTCATTAGA AGAACTTATG AGCATGCACA ATGCATTAGC
CCTAAGTTAC AACACTCATA TACTATGGCT ATTAGTGAGT GTGAATATAT TTGGCGAGAT
GCACGCAAAA ACAGTAATTT CAAACAGCTG GTTCCACACT TAAACCAAGT TTTTGAGATT
TCCCGCACTA TAGCAGATTG TAAAGCTAAG CACTTCCAAA AAGATCCTTA TGACATGCTC
ATGGATACTT ATGAAGCAGA TAGCAGCGCA AAAGAAATTC AGGAAGTATT TGATGTACTT
AAGCGCGAAT TACCTAAGCT TATTGAGAAG ATTACAGCTA AACAACAAAA TGAAAAAATC
ATCCCACTTT CTGAGAAAAT AGATATAAAT ACACAAAAAG CTATTGGCTT GCACATTATG
GAAAGGATGG GATTTGATAT GGATAAAGGA CGTTTAGATA TTTCTGCACA TCCTTTTTGT
AGTGGCTCCA ATGATGACGT AAGGCTTACC ACTCGCTATA ATGAAAATAA TTTTATAACA
GGTTTATTTG GTATTATACA TGAGGCAGGG CATGGTTTAT ATCAGCAGAA TCTTCCAGAA
GCATATAGAA ACCAGCCAGT TGGCCATTAT AAAGGTATGG CTTTTCATGA AAGCCAATCT
TTAATTATGG AATGCCAAGC AGGCACCTCT TTAGAATTTA TACAGTACTT AGCAAAGCTT
TTACATGATA ATTTTGGGTT AAAAAGCCCT GCCTATTCTG CAGAAAACTT ATATAAACTA
GTAACCAGGG TCCAGCCTAG CTTTATTCGT GTAGATGCCG ATGAGGCCAC TTATCCTTTA
CATGTCATAC TGCGATTTGA AATTGAACAA GCCATCATTA AAGATAGAGT GCAAGCAGAG
GATCTGCCAA ACTTATGGAA CACTAAAATG CAGGAGTACT TGGGTATTGT TCCTGCTAAC
GATAGAGAAG GATGTATGCA AGATGTACAC TGGTCAGCTG GCTTATTAGG TTACTTTTCT
TGCTATACTA ATGGGGCGAT TATTGCTAGT ATGCTCATGA AGGCTGCACA AGAAAAGTAC
CCTGCTATTA AAAGCCAATT AAGCGAGGGT AATTTCCAGA ATTTAAATAA CTATCTCAAT
CAGCACTTAA GAAATTTAGG TTCTCTAAAA GGTTCTACTG AATTACTTAA AACTGCAACA
GGATTTGAAA AAATCAATCC TAATATTTTC TTAGAATACT TAACCAATAA GTATTTGGCA
TCATAG
 
Protein sequence
MSDRTNNYLK LENRLKQITH LKNIASLAHW DAEINLPTAS TANRHQELAT LAEIIHQMSV 
AKELGDLIEA ATQEVSELNE WQKANLALIR RTYEHAQCIS PKLQHSYTMA ISECEYIWRD
ARKNSNFKQL VPHLNQVFEI SRTIADCKAK HFQKDPYDML MDTYEADSSA KEIQEVFDVL
KRELPKLIEK ITAKQQNEKI IPLSEKIDIN TQKAIGLHIM ERMGFDMDKG RLDISAHPFC
SGSNDDVRLT TRYNENNFIT GLFGIIHEAG HGLYQQNLPE AYRNQPVGHY KGMAFHESQS
LIMECQAGTS LEFIQYLAKL LHDNFGLKSP AYSAENLYKL VTRVQPSFIR VDADEATYPL
HVILRFEIEQ AIIKDRVQAE DLPNLWNTKM QEYLGIVPAN DREGCMQDVH WSAGLLGYFS
CYTNGAIIAS MLMKAAQEKY PAIKSQLSEG NFQNLNNYLN QHLRNLGSLK GSTELLKTAT
GFEKINPNIF LEYLTNKYLA S