Gene Aasi_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1039 
Symbol 
ID6376896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1345015 
End bp1346358 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content35% 
IMG OID642682155 
Producthypothetical protein 
Protein accessionYP_001958116 
Protein GI189502399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTGT ACCAAATGAA AAATAGCTGG TGGCAAAAAA CCCTATATTA TTTGTACCAC 
CTTCCACAAA ATATCTACAG TTATTCTAAA TATAAGGTAG AAAAATATGG CGCAGAATAT
ACGCCATTTG CCTTTTTTAT GGCGTTTAAT TATATGATTC CTCTTTTTAT GGGAATTCAG
CCAGGAAATA ATTCAATTTA TGTATGGTTG TTGATTGTCA AAACCATTGG AGTTTCTCTT
TGCATAGGTC TTCTTTTAAG GTCGTATTGG GCTGAGTGGT TCCAAAAATA CTTCCCTATT
TACTGGCATT TTACTCTATT TTACTGCATT CCTTTTTCTT TTACTCTGCT ATTTTTAATG
AATGGTAGCA ATATTCAGTC GCTTCTAAAC GTGGCATTAG CTAGCATGTT ACTGATCTTA
TTAGTTGACT GGACCACATT TTTGTCGCTT TCTTTAGCGG GAAGCATTAT AGGTATATGG
TTTTATGCAC AATTTATAGA CAAGATACCT GTATTAGATC TTTATGATCT ATATATAATT
TTTCAGGTGT GTGTTTTTTC CATGGTAATT GGGCTCATCT TTGGACGTCG ACGTGAAATC
TACATGGACT TTATGTCTAA GGGAAAATAT CAACTCCGCA AAACTCAACA AGAAGCTTTC
TGGGAAGTTG CTGCTCTTAT GCAGTACCGA GAAACACTAC TGCAAGAATT AAATCCGGAT
AAGGGCGCTA TTTCTAATGA TATAATAGCC ACCTATATGC AGCAAGCTAT TTGTCGAATG
GCTAACCATA TGCAGTTAGA TATTACCTTG GTGAATTTAA AGGAACTTTT AAAAGAAGTG
GGTGTATCTT GTAACAGCAC AGATTTAAAA CAATCACAAT TGTACTTTAA GAAAGAAACA
GAAAAAGATG TATTAAGAAC AGATGTTAGT AAGCTCAAGC AACTCCTTTT TAATGCGATC
AACTATATTG GCAGAGACCA TAGTTCCAAT CCTATTAATG TCGTAATAAA AGACTCTGAG
ATAAAATATA ATGTAGCTAA TATCGAAACT CCTGGCTCTA GTAAGTTACC TGCCTTGGAG
ATAGTGATTA CTAAAGAGTT CCTGCAGCCC TCTAAATCTC AGCTACCCCT TTACATGATT
GACCTCAACC TATCTAGTGG TTGGTTAGAC GAGCAAGAAA ATGATCCTTT GCTTATAGAA
AATGCCTATA TTATTGATAC TCAATATGGG TATGTAGCAA AACAGTCATC TCAAACACTA
GTGTATATCT TGCCAGTCTC CCTTCCGAAA ATGCAATATA TATCTACACA ATTTTTTAAC
AAGTATACAG AAGCAAAGAT TTAA
 
Protein sequence
MQVYQMKNSW WQKTLYYLYH LPQNIYSYSK YKVEKYGAEY TPFAFFMAFN YMIPLFMGIQ 
PGNNSIYVWL LIVKTIGVSL CIGLLLRSYW AEWFQKYFPI YWHFTLFYCI PFSFTLLFLM
NGSNIQSLLN VALASMLLIL LVDWTTFLSL SLAGSIIGIW FYAQFIDKIP VLDLYDLYII
FQVCVFSMVI GLIFGRRREI YMDFMSKGKY QLRKTQQEAF WEVAALMQYR ETLLQELNPD
KGAISNDIIA TYMQQAICRM ANHMQLDITL VNLKELLKEV GVSCNSTDLK QSQLYFKKET
EKDVLRTDVS KLKQLLFNAI NYIGRDHSSN PINVVIKDSE IKYNVANIET PGSSKLPALE
IVITKEFLQP SKSQLPLYMI DLNLSSGWLD EQENDPLLIE NAYIIDTQYG YVAKQSSQTL
VYILPVSLPK MQYISTQFFN KYTEAKI