Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0995 |
Symbol | |
ID | 6377169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1294440 |
End bp | 1295678 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642682117 |
Product | hypothetical protein |
Protein accession | YP_001958078 |
Protein GI | 189502361 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.20271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAACTT CCACACCTCA ATCTATTATA TGGACTCCAC AAGAAATTAG GCGACAATTT CCTAGCCTTG AGCAAAAGGT GCATGGGAGT AAGCCATTGG TTTATCTAGA TAATGCTGCC ACTACTCAGA AGCCGCAGAC TGTATTAGAT GCGCTTATCC AGCATTATAA TTATAGTAAT GCCAATGTAC ATCGGGCCAT GCATGTTTTA GCAGATAGAG CTACAGAAGC TTTAGAAGAC ACTAGAAAAA CTGTACAAGA ATTTATCAAT GCGCCAGGAG CTGAAGAAAT TATATTTACT TCAGGTACTA CGGCTAGTAT TAATTTAGTA GCTAGTAGTT ATGGGCAAGT TTATATACAG CCAGGAGACG AGATTATTAT TTCTCATATG GAGCATCATG CTAATATAGT CCCTTGGCAG ATGCTATGCC AAACAAGGAA AGCTCATCTT AAAGTAATTC CTATTGATGA TAGAGGGGAG CTGATAATGT CTTCTTTTGA ACAGTTGTTA ACTGCAAAAA CCAGACTTGT AGCTGTTGCC TATGCTTCTA ATAACTTAGG CACTATTAAT CCCATCCAAG AAATTATAGC TAAAGCACAC CATGCAGGAG CTTTAGTATT AATAGATGCT GCCCAAGCAG CAGCCCACTT ACTTATAGAT GTACAGAGTT TAGATTGTGA TTTTCTGGCT TTTTCAGCAC ACAAAGCTTA CGGACCTACA GGGGTAGGTA TTTTATATGG AAAAAGAGCA TTACTAGAAC AAATGCCCCC TTATCAAGGA GGGGGGGAAA TGATTAAGGA AGTAACCTTA TCTAGTAGTA CTTATAACGA CATACCTTAC AAATTTGAGG CTGGCACCCC TAACATTGCG GATATTATAG GCTTTCGAGC AGCTTTGGAC TTTATCCGAA ACTTAGGATG GTCGATTATT AACAAACATG AAAAAGAATT AACTAGCTAT ACACAGCATC TTTTAGGCAA AATTGATAGA ATTAGACTTA TTGGCACAGC AACAGATAAG GTAGGAATTG TATCTTTTAC AGTAGATAAA ATGCATCATT TAGATGTAGG AATGTTGTTA GATGCACAAG GTATTGCTGT AAGGACGGGC CATGGTTGTG CCCAGCCACT TATGCAGCGG CTGGGAGTAG AAGGTATTGT ACGTGTATCT TTGGCTGTAT ATAATACTTT TGAAGAAATA AATTATTTAG CACATGTAGT AGCTAAAATA GTGAAATAA
|
Protein sequence | MITSTPQSII WTPQEIRRQF PSLEQKVHGS KPLVYLDNAA TTQKPQTVLD ALIQHYNYSN ANVHRAMHVL ADRATEALED TRKTVQEFIN APGAEEIIFT SGTTASINLV ASSYGQVYIQ PGDEIIISHM EHHANIVPWQ MLCQTRKAHL KVIPIDDRGE LIMSSFEQLL TAKTRLVAVA YASNNLGTIN PIQEIIAKAH HAGALVLIDA AQAAAHLLID VQSLDCDFLA FSAHKAYGPT GVGILYGKRA LLEQMPPYQG GGEMIKEVTL SSSTYNDIPY KFEAGTPNIA DIIGFRAALD FIRNLGWSII NKHEKELTSY TQHLLGKIDR IRLIGTATDK VGIVSFTVDK MHHLDVGMLL DAQGIAVRTG HGCAQPLMQR LGVEGIVRVS LAVYNTFEEI NYLAHVVAKI VK
|
| |