Gene Aasi_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1074 
Symbol 
ID6377405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1386461 
End bp1387957 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content38% 
IMG OID642682187 
Producthypothetical protein 
Protein accessionYP_001958148 
Protein GI189502431 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAA ATTTAAAAAC TCCCGGCGTA TATATCGTCG AGAAGGACAC CGGTGCCAAT 
GCTGTGGTTC AGGTGGCAAC TGCAGTTCCC GTTTTCATAG GATTTACCGA GCGGGCAGAA
ATCAATGGAA AATCTTTCCA TATGAAGCCG GTGCATATTA ACTCTTTATC TGAGTTTGAA
ATATTCTATG GAAAAGCACC TGTGCCTGTC TTTACCGTTA AACCTGCAGA AAAAGGAGGT
GGAGATCTTA ATATGAATGG ACAAATGTAT ACCTTACAAC AAAGTCCTTA TTCAAAATTT
TACCTATACA ATAGTTTAAA ATTATTTTTT GATAATGGTG GTGCAGATTG CTACATCATA
TCTATTGGAC AATATGGTAA AGATCCACAA CTGCTAGCAA TTACCCCTGA TACATTCAAA
AAAGCAATAG ATACCTTAGC AGGCGAAGAA GTACCTACTA TGTTGCTTAT GCCCGACTCT
CTGCTACTAG ATGAAGAAGA TTCTTCTTAT TATTCTGTAC AAACATATGC TTTGCAACAT
TGTGGCAAAT ATATGGATAA AGTAGCGCTA TTTGATATCT GGGGAAGTGG AGAAGAGCTT
CCATTAGGAG AAGACAAAAA TAAATATGTA ACTCGATTTA GAGAAAATAT AGGCTTAGAC
AACCTAACCT ATGGTGCAGC GTACTACCCT TGGGTTAAAA CCAATATCAT ATCAATCAAC
GATATTGGAT ATGAGAACTT TAATTTAGAT TCTTTAGAAT CTCTTATTAA TGAAGCACAC
AAACCTATCC TGCACAATAT CAAAACTGCT ACTAGCGAAA AGGAGAAAAA ATATTGGGAT
GCAGGACTTA AAAATGCTAG TAAAGAATAT AAGCTTCTAC GTAAAACTAT AGCAGACAGA
CTTAATGTAT TGCCAGCAGC ACCAGCTATG GCAGGGTTGT ACACACGTAC TGATAGAAGT
AGAGGCGTAT GGATAGCACC AGCTAACCAA AACCTAAATT CTGTTATTGA GCCTGCTATT
AAGATTACGC ATGAAGATCA AGAAACTCTT AACGTAGATG CTATAAGCGG AAAATCTATT
AATGCTATCC GTGCATTTAG AGGAAGAGGA TCTGCTATTG TTTGGGGGGC AAGAACGTTG
GCAGGCAACA ATGTAGAATG GCGCTATATT AACGTAAGGA GATTATTTAT ACTTATTGAA
CAGTCTATCA AACAAGCATC CTTCTCTGTT GTATTCCAAC CTAACGTATC CATAACCTGG
GCTATAGTAA AAGGAAGTAT TGGTAACTTC TTAACCAACT TGTGGAGACA AGGTGCTTTA
GTAGGAAACA CTCCTTCTGA AGCCTTTACA GTAAGCTGTG GACTTGGTGA AACTATGACT
CAAGAAGACA TTAATGAAGG TATCATGCGA ATAAAAGTTC AGGCAGCAGC TTCTAGACCA
GCAGAGTTTA TCGTCATTAC ATTTGAGCAA AAGATGGGCG GACAAGAAGG AAGTTAA
 
Protein sequence
MPENLKTPGV YIVEKDTGAN AVVQVATAVP VFIGFTERAE INGKSFHMKP VHINSLSEFE 
IFYGKAPVPV FTVKPAEKGG GDLNMNGQMY TLQQSPYSKF YLYNSLKLFF DNGGADCYII
SIGQYGKDPQ LLAITPDTFK KAIDTLAGEE VPTMLLMPDS LLLDEEDSSY YSVQTYALQH
CGKYMDKVAL FDIWGSGEEL PLGEDKNKYV TRFRENIGLD NLTYGAAYYP WVKTNIISIN
DIGYENFNLD SLESLINEAH KPILHNIKTA TSEKEKKYWD AGLKNASKEY KLLRKTIADR
LNVLPAAPAM AGLYTRTDRS RGVWIAPANQ NLNSVIEPAI KITHEDQETL NVDAISGKSI
NAIRAFRGRG SAIVWGARTL AGNNVEWRYI NVRRLFILIE QSIKQASFSV VFQPNVSITW
AIVKGSIGNF LTNLWRQGAL VGNTPSEAFT VSCGLGETMT QEDINEGIMR IKVQAAASRP
AEFIVITFEQ KMGGQEGS