Gene Aasi_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0995 
Symbol 
ID6377169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1294440 
End bp1295678 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content38% 
IMG OID642682117 
Producthypothetical protein 
Protein accessionYP_001958078 
Protein GI189502361 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily
[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.20271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACTT CCACACCTCA ATCTATTATA TGGACTCCAC AAGAAATTAG GCGACAATTT 
CCTAGCCTTG AGCAAAAGGT GCATGGGAGT AAGCCATTGG TTTATCTAGA TAATGCTGCC
ACTACTCAGA AGCCGCAGAC TGTATTAGAT GCGCTTATCC AGCATTATAA TTATAGTAAT
GCCAATGTAC ATCGGGCCAT GCATGTTTTA GCAGATAGAG CTACAGAAGC TTTAGAAGAC
ACTAGAAAAA CTGTACAAGA ATTTATCAAT GCGCCAGGAG CTGAAGAAAT TATATTTACT
TCAGGTACTA CGGCTAGTAT TAATTTAGTA GCTAGTAGTT ATGGGCAAGT TTATATACAG
CCAGGAGACG AGATTATTAT TTCTCATATG GAGCATCATG CTAATATAGT CCCTTGGCAG
ATGCTATGCC AAACAAGGAA AGCTCATCTT AAAGTAATTC CTATTGATGA TAGAGGGGAG
CTGATAATGT CTTCTTTTGA ACAGTTGTTA ACTGCAAAAA CCAGACTTGT AGCTGTTGCC
TATGCTTCTA ATAACTTAGG CACTATTAAT CCCATCCAAG AAATTATAGC TAAAGCACAC
CATGCAGGAG CTTTAGTATT AATAGATGCT GCCCAAGCAG CAGCCCACTT ACTTATAGAT
GTACAGAGTT TAGATTGTGA TTTTCTGGCT TTTTCAGCAC ACAAAGCTTA CGGACCTACA
GGGGTAGGTA TTTTATATGG AAAAAGAGCA TTACTAGAAC AAATGCCCCC TTATCAAGGA
GGGGGGGAAA TGATTAAGGA AGTAACCTTA TCTAGTAGTA CTTATAACGA CATACCTTAC
AAATTTGAGG CTGGCACCCC TAACATTGCG GATATTATAG GCTTTCGAGC AGCTTTGGAC
TTTATCCGAA ACTTAGGATG GTCGATTATT AACAAACATG AAAAAGAATT AACTAGCTAT
ACACAGCATC TTTTAGGCAA AATTGATAGA ATTAGACTTA TTGGCACAGC AACAGATAAG
GTAGGAATTG TATCTTTTAC AGTAGATAAA ATGCATCATT TAGATGTAGG AATGTTGTTA
GATGCACAAG GTATTGCTGT AAGGACGGGC CATGGTTGTG CCCAGCCACT TATGCAGCGG
CTGGGAGTAG AAGGTATTGT ACGTGTATCT TTGGCTGTAT ATAATACTTT TGAAGAAATA
AATTATTTAG CACATGTAGT AGCTAAAATA GTGAAATAA
 
Protein sequence
MITSTPQSII WTPQEIRRQF PSLEQKVHGS KPLVYLDNAA TTQKPQTVLD ALIQHYNYSN 
ANVHRAMHVL ADRATEALED TRKTVQEFIN APGAEEIIFT SGTTASINLV ASSYGQVYIQ
PGDEIIISHM EHHANIVPWQ MLCQTRKAHL KVIPIDDRGE LIMSSFEQLL TAKTRLVAVA
YASNNLGTIN PIQEIIAKAH HAGALVLIDA AQAAAHLLID VQSLDCDFLA FSAHKAYGPT
GVGILYGKRA LLEQMPPYQG GGEMIKEVTL SSSTYNDIPY KFEAGTPNIA DIIGFRAALD
FIRNLGWSII NKHEKELTSY TQHLLGKIDR IRLIGTATDK VGIVSFTVDK MHHLDVGMLL
DAQGIAVRTG HGCAQPLMQR LGVEGIVRVS LAVYNTFEEI NYLAHVVAKI VK