Gene Aasi_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1230 
Symbol 
ID6377260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1573759 
End bp1574883 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content35% 
IMG OID642682326 
Producthypothetical protein 
Protein accessionYP_001958284 
Protein GI189502567 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0232015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAC AAGAAAAAAT TTATCAATTA GCTTTGAGCC TTATTAAAGG TATAGGATTT 
AATACATGGA AGAGGATAAT TGAAAAATTC AAAACAGCTC AGGCAATTTT CCAGGCATCA
AAGACATCCC TCACAGGTAA TCTGCCAGGT ATTCCCTTAT CTATTATACA AGCTATTTTA
GCTAAGGATA CGCTATCGAT AGCAGAGAAA TTAGTGGGTG CACATCAAAA GAATGGTATA
CAGGTTCTCT CCTTTTTTGA TGAATCTTAC CCCTTGCGCC TTAAACATAT TGCTACTCCT
TCCAGTTTTT TATTTTGCCA GGGTAATATG AATTTTAGTA TGTCTAAGGT TATAAGTATT
GTAGGCACTA GAAAGGCTAC CCCTTATGGG AAAAGCTTTG TGGAAAAGTT TATAGCAGGT
TTAAGGGAAT ATGAAGAGAT TCTAGTTGTT AGTGGATTGG CATATGGTAT TGATCTTCAA
GCACATAAAA TGTGTTTACG TTATGGATTA TCTACAGTAG GTGTATTGGC AGGTGGGTTA
GACAAGATTT ATCCGACAGC CCATAAAAAG GTAGCTCTAG ATATGTTAGC CGATGGTGGC
TTGGTAAGTG AAATTCCTAT TGGTAGCACA TTAGAAACTT TCCAATTTCC CCAAAGAAAC
AGAATTATTG CAGGATTGGC AGATGCTACT GTTGTGGTAG AGGCAGACTA TAAAAGTGGT
GCTATTATTA CAGCTAATTT TGCTAATGCT TATAACAGAG AGGTATTTGC TGTGCCAGGT
AATATTGATG CTACTTATTC AGCTGGCTGT AACCATTTAA TTAAAACTCA ACAAGCGCAT
CTATTAACAA GTACTGATGA TCTAGCTTAT ATAATGAATT GGCAAAAGTG CTCTCAACCA
AATAATCTTA TAAATTCACA TAAAGAAAAA TTGGTAGGCC TTAGTCAAGT AGAGCAAGAG
ATAGTACAAG TATTAAAATT GTTACAAAAA GAAGCTTATA TTGATGAGAT AAGTCAACAA
ATTCAATTAT CTCCTAGCCA GGTTTCATCT ATGCTTTTGC AACTAGAATT AAAAAATATT
GTCGAATGTC TGCCAGGTAA TAAGTTTAAG CTAGTGAAAT TTTAA
 
Protein sequence
MHEQEKIYQL ALSLIKGIGF NTWKRIIEKF KTAQAIFQAS KTSLTGNLPG IPLSIIQAIL 
AKDTLSIAEK LVGAHQKNGI QVLSFFDESY PLRLKHIATP SSFLFCQGNM NFSMSKVISI
VGTRKATPYG KSFVEKFIAG LREYEEILVV SGLAYGIDLQ AHKMCLRYGL STVGVLAGGL
DKIYPTAHKK VALDMLADGG LVSEIPIGST LETFQFPQRN RIIAGLADAT VVVEADYKSG
AIITANFANA YNREVFAVPG NIDATYSAGC NHLIKTQQAH LLTSTDDLAY IMNWQKCSQP
NNLINSHKEK LVGLSQVEQE IVQVLKLLQK EAYIDEISQQ IQLSPSQVSS MLLQLELKNI
VECLPGNKFK LVKF