Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3793 |
Symbol | |
ID | 8826663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | + |
Start bp | 174025 |
End bp | 175911 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | transposase IS4 family protein |
Protein accession | YP_003481896 |
Protein GI | 289583486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.209171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTTT GTTCCCACAC AGAATCTCCC TTTGACCACG ACAGGAATTT CACGTTCACC GTCAACGACG AGACATACAA CGGGAGTATC CACTTCTTCA CTCACTACAA CGAGAAGCCC GATTTCATCG AGGTCTTACG AAACACCTTC TTCCAAGACG AAACGTATGG CGATGCCCGC GCGAAGTGGC ACCGGAACAC CACGTCATTT TACGCGTGGG TGAAGGCACA TATGCTCAGG CTCGCATGGG ATTGCCGCGA GAGACTCCTC CACCGCTTCC TTCACTCGTT CCCCAACATT TGCCGTGATT TTGGCTTCTC TGTTGCTCAC AACCGCGACA CCAGCGGAGC GCCGAGCCAG TCCCGGCTCT GGGAGATGTG GAACAAAGAG TTCACCGACG TACAGCGTGA GTTCGTGCGT ACAGCCACCG AGGAAGCCCT CGCCTTCGCC CGCGAACAAG GCATTCCTGC TCCTGACCCA GTGTTCCGAC CCGAGGAACG GGACATCTCC TCGAAGCGGA GCGAACAGCG ACTCGTCGCA GAGAAGACCA AGGAGGTCTG GCAACAGGCC AAACCGTTCG TCACGGACAC ATTCTACCTG AAGCGAGCCG ACAATAGCGT AATCCACGAG AACGCGTTCT GGGAACAGCA CGCCTACATG GGGATGCGCG AGAATATGTA CGCACAGAGC GGTCAACACT CCTTCTTCAT CGACTCCCAG CGTGACCACA CACCGAGCGC ATCCAATCAT CGCTACCAGA TAGGGAAGCT CACCGTCGAG GAGACACGCT CGATGCTCCA CGAGACAACC CGGATGCTCA TCGCCCGCGC TCGGCACAAT TCCGAACTCG TCGGCAAACT TTGGGCCGCC ATCGACATTA CCAAGGGGAA TCCGTGGACT GGTGAAATCG AGCGCGACGA GGACGACAAC ATCACGGAGG ACTGGATTCT CGGCTACAAG GACGGCGAAG TGTACTATCA GTGGGCGACC ATCCAGATTG TGGGCTACGA CATTCCGCTC GTACTGGACG CAATACCCGT CAAACGCGGG ATGAAGCGAG CCGACATCGT GGACAGCTTG CTGGAGAATG CGCTTGACCT CGTAGACGAC ATCGAACTCG TGATGATGGA CAGAGAGTTC GATAATGATG GCGTGAAGGA CGCGTGTGAT AAACACGGGG TCTACTACCT GAATGGCGCA CGCAAACGCC AATCTGAAAG AGCGACGTGT ACACGGCTTC GCCGTGCCGG AAAGACCGTT CACATCGAAG AGGAAACAGT CCCAGATGGT CCGACTCGCA AGCGGATGTT CCTCCCCTCC AGCACGGACG ACCCTGATGC GGAGGATATG GAAGAGAGCA GCGAACCCGT AAAGGGGAGT TCGGATGTCC GCGAAGAGAT GCGTGAAGAC CTCGCTGAAC TCGGTATCGA CCTAAACGAT GACGACGACA GACGCGGTTT CGGTCCGGTC ATTGACGACC TCCGTGAGCA GGAGGCAAAT GAGCCGACTG TCGGAAGCGA CGAGGATGCG CAGACGTATG CGTTGTTCGA GACGAACCAC CCCAGCGTGA CGCTGAATGA CGACGACAGC GAGATAGAGC GCATTCACAT GGTTGAGCGG ATGGTTCGCC GGTATCGCCA CCGCTGGGGC ATTGAGAATG GTTACAAGCA AATCAAGACG TTTCGCGTCC GCACGACGAG TAAACGCCAC ACGTATCGGT TCTTCAATTT CGTGTTTGCG TGCGTGCTGT ACAACGTCTG GCGGCTCGTG GACTTGCTGG TGAAACTCGC CACCGAGGGT GAGAACACGA CGTATGCGCC GCGTGTGGAC GCGAACCAGT TCTTGACTGT GGCGAAGAAA TACTACGGTC TCGACCCACC CGACTAA
|
Protein sequence | MAVCSHTESP FDHDRNFTFT VNDETYNGSI HFFTHYNEKP DFIEVLRNTF FQDETYGDAR AKWHRNTTSF YAWVKAHMLR LAWDCRERLL HRFLHSFPNI CRDFGFSVAH NRDTSGAPSQ SRLWEMWNKE FTDVQREFVR TATEEALAFA REQGIPAPDP VFRPEERDIS SKRSEQRLVA EKTKEVWQQA KPFVTDTFYL KRADNSVIHE NAFWEQHAYM GMRENMYAQS GQHSFFIDSQ RDHTPSASNH RYQIGKLTVE ETRSMLHETT RMLIARARHN SELVGKLWAA IDITKGNPWT GEIERDEDDN ITEDWILGYK DGEVYYQWAT IQIVGYDIPL VLDAIPVKRG MKRADIVDSL LENALDLVDD IELVMMDREF DNDGVKDACD KHGVYYLNGA RKRQSERATC TRLRRAGKTV HIEEETVPDG PTRKRMFLPS STDDPDAEDM EESSEPVKGS SDVREEMRED LAELGIDLND DDDRRGFGPV IDDLREQEAN EPTVGSDEDA QTYALFETNH PSVTLNDDDS EIERIHMVER MVRRYRHRWG IENGYKQIKT FRVRTTSKRH TYRFFNFVFA CVLYNVWRLV DLLVKLATEG ENTTYAPRVD ANQFLTVAKK YYGLDPPD
|
| |