Gene Nmag_3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3793 
Symbol 
ID8826663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp174025 
End bp175911 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content57% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003481896 
Protein GI289583486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTTT GTTCCCACAC AGAATCTCCC TTTGACCACG ACAGGAATTT CACGTTCACC 
GTCAACGACG AGACATACAA CGGGAGTATC CACTTCTTCA CTCACTACAA CGAGAAGCCC
GATTTCATCG AGGTCTTACG AAACACCTTC TTCCAAGACG AAACGTATGG CGATGCCCGC
GCGAAGTGGC ACCGGAACAC CACGTCATTT TACGCGTGGG TGAAGGCACA TATGCTCAGG
CTCGCATGGG ATTGCCGCGA GAGACTCCTC CACCGCTTCC TTCACTCGTT CCCCAACATT
TGCCGTGATT TTGGCTTCTC TGTTGCTCAC AACCGCGACA CCAGCGGAGC GCCGAGCCAG
TCCCGGCTCT GGGAGATGTG GAACAAAGAG TTCACCGACG TACAGCGTGA GTTCGTGCGT
ACAGCCACCG AGGAAGCCCT CGCCTTCGCC CGCGAACAAG GCATTCCTGC TCCTGACCCA
GTGTTCCGAC CCGAGGAACG GGACATCTCC TCGAAGCGGA GCGAACAGCG ACTCGTCGCA
GAGAAGACCA AGGAGGTCTG GCAACAGGCC AAACCGTTCG TCACGGACAC ATTCTACCTG
AAGCGAGCCG ACAATAGCGT AATCCACGAG AACGCGTTCT GGGAACAGCA CGCCTACATG
GGGATGCGCG AGAATATGTA CGCACAGAGC GGTCAACACT CCTTCTTCAT CGACTCCCAG
CGTGACCACA CACCGAGCGC ATCCAATCAT CGCTACCAGA TAGGGAAGCT CACCGTCGAG
GAGACACGCT CGATGCTCCA CGAGACAACC CGGATGCTCA TCGCCCGCGC TCGGCACAAT
TCCGAACTCG TCGGCAAACT TTGGGCCGCC ATCGACATTA CCAAGGGGAA TCCGTGGACT
GGTGAAATCG AGCGCGACGA GGACGACAAC ATCACGGAGG ACTGGATTCT CGGCTACAAG
GACGGCGAAG TGTACTATCA GTGGGCGACC ATCCAGATTG TGGGCTACGA CATTCCGCTC
GTACTGGACG CAATACCCGT CAAACGCGGG ATGAAGCGAG CCGACATCGT GGACAGCTTG
CTGGAGAATG CGCTTGACCT CGTAGACGAC ATCGAACTCG TGATGATGGA CAGAGAGTTC
GATAATGATG GCGTGAAGGA CGCGTGTGAT AAACACGGGG TCTACTACCT GAATGGCGCA
CGCAAACGCC AATCTGAAAG AGCGACGTGT ACACGGCTTC GCCGTGCCGG AAAGACCGTT
CACATCGAAG AGGAAACAGT CCCAGATGGT CCGACTCGCA AGCGGATGTT CCTCCCCTCC
AGCACGGACG ACCCTGATGC GGAGGATATG GAAGAGAGCA GCGAACCCGT AAAGGGGAGT
TCGGATGTCC GCGAAGAGAT GCGTGAAGAC CTCGCTGAAC TCGGTATCGA CCTAAACGAT
GACGACGACA GACGCGGTTT CGGTCCGGTC ATTGACGACC TCCGTGAGCA GGAGGCAAAT
GAGCCGACTG TCGGAAGCGA CGAGGATGCG CAGACGTATG CGTTGTTCGA GACGAACCAC
CCCAGCGTGA CGCTGAATGA CGACGACAGC GAGATAGAGC GCATTCACAT GGTTGAGCGG
ATGGTTCGCC GGTATCGCCA CCGCTGGGGC ATTGAGAATG GTTACAAGCA AATCAAGACG
TTTCGCGTCC GCACGACGAG TAAACGCCAC ACGTATCGGT TCTTCAATTT CGTGTTTGCG
TGCGTGCTGT ACAACGTCTG GCGGCTCGTG GACTTGCTGG TGAAACTCGC CACCGAGGGT
GAGAACACGA CGTATGCGCC GCGTGTGGAC GCGAACCAGT TCTTGACTGT GGCGAAGAAA
TACTACGGTC TCGACCCACC CGACTAA
 
Protein sequence
MAVCSHTESP FDHDRNFTFT VNDETYNGSI HFFTHYNEKP DFIEVLRNTF FQDETYGDAR 
AKWHRNTTSF YAWVKAHMLR LAWDCRERLL HRFLHSFPNI CRDFGFSVAH NRDTSGAPSQ
SRLWEMWNKE FTDVQREFVR TATEEALAFA REQGIPAPDP VFRPEERDIS SKRSEQRLVA
EKTKEVWQQA KPFVTDTFYL KRADNSVIHE NAFWEQHAYM GMRENMYAQS GQHSFFIDSQ
RDHTPSASNH RYQIGKLTVE ETRSMLHETT RMLIARARHN SELVGKLWAA IDITKGNPWT
GEIERDEDDN ITEDWILGYK DGEVYYQWAT IQIVGYDIPL VLDAIPVKRG MKRADIVDSL
LENALDLVDD IELVMMDREF DNDGVKDACD KHGVYYLNGA RKRQSERATC TRLRRAGKTV
HIEEETVPDG PTRKRMFLPS STDDPDAEDM EESSEPVKGS SDVREEMRED LAELGIDLND
DDDRRGFGPV IDDLREQEAN EPTVGSDEDA QTYALFETNH PSVTLNDDDS EIERIHMVER
MVRRYRHRWG IENGYKQIKT FRVRTTSKRH TYRFFNFVFA CVLYNVWRLV DLLVKLATEG
ENTTYAPRVD ANQFLTVAKK YYGLDPPD