Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0762 |
Symbol | |
ID | 8823590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 757732 |
End bp | 759489 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | tail sheath protein |
Protein accession | YP_003478909 |
Protein GI | 289580443 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00265038 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAGT ATCAAGCACC CGGAGTTTAC GTCGAAGAAC AAAGCACCGG AAGCAAATCA GTCGAAGGTG TAAGTACTAG TACTGCAGGA TTTCTGGGAC AGACGGTTCG TGGACCGGTC GAACCACAGC TCATCACGAG CTACAACGAA TTCGAACGCA TCTATGGATC GAGTCCCAAA GAGTCGAACC TAGACGTCTC AGTCAACGGC TTCTTCAAAA ACGGAGGGAG TCGCTGCTAC GTCGCTCGAG TTACTGCAGC CGATCCCAAC GACGTCGCGA CGAGAACGTT GATCGACGAC GACAAAAACG GTGTCGTCGA ACTCGAGGCG AACGGTCCGG GCGACTGGGG GTCGAACGTC GCCGTGATTG TTCGTGACGG CCAACATCCG GACCAGTTCG ATATAATCGT CCGATACTGG TCGTGCGACC GAACGGAAGT GATGGATCCC GACGGTAATC GGCCCGAACC TTCACCGGAC GTCGAGGAGG TGTTTGACGG ACTGTCGACG GATCCGAAAT CCAGTCAGTT CCACGAAAAA CAACTGGCGA GTTCTGTGTT AGTCAACATC GAGTATCTCG ACGATGGCCG ACCGAAAAAC GGACTCGTCT GGTTGAGCCG AGATGACCAC GAAATCCGCA CTGATGGTGG TACCGTGGCA GTTGACCACG AGGACGTGCT GCACATTCCG GAAGATTTGG ACGAACTCGA CGAAGACGAA CTGGAAGCAC TCGCCGAACC CGTCGACATC GATGCAGATC CATCGTCGGA TGAATTCATC GATACACTCG AACAGATTCG AGACGGGGAG CGCGAGGTCG ACATGGAAGT CGTTACCGAA CTACCCGAGC AGGCAGAACC GGAGTCCGGA TTCGAGTCCG AATCCGAATC CGACTCCGAC GGTGAAGTGA CACTCAACGA CTACGAAGGT GTCAATAAAC CCGACCTCCG AACCGGTCTG GCAGCGTTCG AAGCGATCGA CGAGATTTCC ATCGTCTGCG CTCCGGACGA AAACGACGTG CAGGGACTAA CCGACGCCAT CGTTGCTCAC TGTGAAAACA TGGGTGACCG GTTCGCTATC CTGCAGTCTC CGCAAAATCC CGGCCCGGTG TCAGAAATGG AGACGCCAGT AGACTCCTCC TACGCGGGGT ACTACTACCC CTGGCTCTCG GTTCTCGATC CGGTTACCAA TCGTGAAAAG CTGGCTCCGC CGGGCGGCCA CATCGCGGGG ATCTATTCCC GAAGCGACGT CGACCACGGC GTACACAAGG CCCCTGCGAA CGAACCGCTA CGGGGAATTG TCGGCTTGCA ACGTGACATC ACGAAGGGAG AACAGGATGT CCTCAACCCG AAAGGCGTCA ACTGTATCCG GAGCTTTCAG GGGCGTGGCA TCCGCGTCTG GGGTGCTCGC ACCTGTTCCA GCGATCCGGA ATGGAAGTAT ATCAACGTTC GCCGCCTGTT TCTCTACATC GAGCAGTCGC TCGAAGAGGG AACGCAGTGG GCGGTGTTCG AACCGAACGA CGAAGATCTG TGGGCTCGTA TCCGCCAGTC TACCGAGAAG TTCCTCAAAA CGGTCTGGCG AGAAGGTGGT CTACAGGGAT CGACCGCTGA CGAGGCGTTT TTCGTCCGCT GTGGCGAGGA GACGATGACC CAGGACGACA TCGACAACGG TCGGTTGATC GTCGAAATCG GTATTGCACC AGTCAAACCG GCGGAGTTCG TCGTGTTCCG AATCGCACAG GACACCGAAA CCGCCTGA
|
Protein sequence | MPEYQAPGVY VEEQSTGSKS VEGVSTSTAG FLGQTVRGPV EPQLITSYNE FERIYGSSPK ESNLDVSVNG FFKNGGSRCY VARVTAADPN DVATRTLIDD DKNGVVELEA NGPGDWGSNV AVIVRDGQHP DQFDIIVRYW SCDRTEVMDP DGNRPEPSPD VEEVFDGLST DPKSSQFHEK QLASSVLVNI EYLDDGRPKN GLVWLSRDDH EIRTDGGTVA VDHEDVLHIP EDLDELDEDE LEALAEPVDI DADPSSDEFI DTLEQIRDGE REVDMEVVTE LPEQAEPESG FESESESDSD GEVTLNDYEG VNKPDLRTGL AAFEAIDEIS IVCAPDENDV QGLTDAIVAH CENMGDRFAI LQSPQNPGPV SEMETPVDSS YAGYYYPWLS VLDPVTNREK LAPPGGHIAG IYSRSDVDHG VHKAPANEPL RGIVGLQRDI TKGEQDVLNP KGVNCIRSFQ GRGIRVWGAR TCSSDPEWKY INVRRLFLYI EQSLEEGTQW AVFEPNDEDL WARIRQSTEK FLKTVWREGG LQGSTADEAF FVRCGEETMT QDDIDNGRLI VEIGIAPVKP AEFVVFRIAQ DTETA
|
| |