Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1416 |
Symbol | |
ID | 5054460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1275970 |
End bp | 1277088 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640468957 |
Product | hypothetical protein |
Protein accession | YP_001153626 |
Protein GI | 145591624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.189691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACAA ACAAAATAAT ACTAATAGTA ACACTGGCGG CGGCACTGGC ATTTGCCCAG ACCGTGACGG TCAGTGCCGG GGCGAACGCG ACTCTGAGCG ACCCGGCTAA GTTCCACCTC TGGCTCAAGT GCAGGGCCGT GGAGCTCTAT ATCAACGCCA CGGACGTCGG GCTGAGGGGC AACGTGACGT TGCCGCCGTG CGAAGAGCTG GTTCAGAACT TCACGTCTAA TAGACTAGTC TTCATCATAG GGAGGGCCGG GGTTAGGCCG CTCCCCATGA TAGGAGTGGG GGAGCTAAAA GCGCTCAACG CGTCCGACCC CCACGGAGTT TTCACACAAC TCAGGGAAAT TAGGAGACAA GCCCTCCTCA ACCTCCCCAA GCAGATCAAC AAGACGGTTG ACACGGCGTA TAGAGAGATT GAAACCGGAA ACGGAACGGA GCCGCTAGAA AGAGCAGTAC TAACACTAAC CAGAGTTAGA AGCTTACTAG AGAAAGTAAA CGCTTCCCAG CGCGCGGTTG ATGTACTGAG CAGAAATATA GAATTTCTAA ACGACACGCG CCAGTTCATC GCCGAGGCGA GGACGGCGCC TTACGACAAA CTAGCAGCGC TGTTGGAGAG AATTCAGAAC CACACGGACC TGCCTCATGC CAAGAAGGTT ATGGAGAGGT ACATGGTCCA GACTAGGGCT ACCATAGAGG AGAGAGCGTG GAAGGAACTT GAGGAGATCT ACTCAAATCT CAACGCGACT TCCGAGACGG AGATGCTCGC CGTGCTCAAT AAATCCGTGG CAACGCTTGA GAAAGTGACG AAACTGCTTG AGAGGGTAAA TGCATCTAAA ACTGCAGTGG AGGCGGTGAG GAGAAACATA TTGTTGTTCA ACGAGACGAA GCAGGCGGTG GAGTCTTTGC TCAGAGGCGA CTACGAAAAA GCCAAGGCCC AGTTTGAAAA ATTAAGGCAA ATTCTCGGCC AGCTCCCCGA GTGGGCTAAA AAAGCGCTGG AGGAAAAGAT AAACAAAATC AGTAAAGCGG TGCAAGAACG CGGAAGCGGC ACCCCAGCGC CGCCCATAGG CAATTCCCGT CCGCAAAGGG GCAACGAGGG GAAGGCGCCG GGCAAATAG
|
Protein sequence | MNTNKIILIV TLAAALAFAQ TVTVSAGANA TLSDPAKFHL WLKCRAVELY INATDVGLRG NVTLPPCEEL VQNFTSNRLV FIIGRAGVRP LPMIGVGELK ALNASDPHGV FTQLREIRRQ ALLNLPKQIN KTVDTAYREI ETGNGTEPLE RAVLTLTRVR SLLEKVNASQ RAVDVLSRNI EFLNDTRQFI AEARTAPYDK LAALLERIQN HTDLPHAKKV MERYMVQTRA TIEERAWKEL EEIYSNLNAT SETEMLAVLN KSVATLEKVT KLLERVNASK TAVEAVRRNI LLFNETKQAV ESLLRGDYEK AKAQFEKLRQ ILGQLPEWAK KALEEKINKI SKAVQERGSG TPAPPIGNSR PQRGNEGKAP GK
|
| |