Gene Nmag_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1930 
Symbol 
ID8824771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1962303 
End bp1964261 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content57% 
IMG OID 
Productorc1/cdc6 family replication initiation protein 
Protein accessionYP_003480063 
Protein GI289581597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACG ACGAAACGGA ACAAACAAAT TCGAACACGG GCACAGATAC CGATTCACAC 
AGCTCACAGC GGGGCGACCG TGCGAGCAGT GATCGTTCCT CTCACCAGTC CGACTTATCT
TCTGACGCGA ACTCGAACTC GAACTCGAAC TCAGACTCTA CGTCGAACCG ACCAGTGAAC
TCGAGTGCGA ATGCGGACGG TCCCAGAAAC AGTACTAGTA CCAGTCCCAG TACCAGTACT
AATTCCAATA CTGCCGGATC TGACAACACT CACGGTTTCT CGACCGATTT CGGTGACACC
GACCTTGGTG ACGACGGTGA GGACTCAAAC CAGGGCCTGT TCGACGACCT CCTGAGTGGC
GAACCAATCT TCGAGAACAA GGAGGTCCTT CGGCCATCGT ACACACCCCA CGAACTTCCC
CACCGAAGCG ACCAGATCAA CAAGATGGCG ACCATCCTGG TCGCCGCGCT TCGCGGCGAA
ACGCCGTCAA ACATCCTCAT CTACGGGAAG ACCGGGACGG GGAAGACCGC GAGCGCGAAG
TTCGTCAGCA AAGAACTCGA GAGTACTTCC CAGAAGTACA GCGTCCCGTG TGACGTCGAG
TACATCAACT GCGAGGTAAC CGACACGCAG TATCGCGTGC TCGCCCAGCT TGCGAACAAG
TTCATCGAGA AGAACAAAGC ACGAATCGAC GACCAGATCG AATCCTTACA GGCGCTCCGT
GAGGAGGTTG CGGCGTACGA CGAGGCATCA GAACGCCAGG GGAGCACCGA CACACCGTCG
GCGACCCAGT CCGAATCCGA ACACAACAGC AACATTTCGA CTGGAAACGA TCCCGATCAG
ATGGCATCTG AGGAGGGCAA CAGTTCACAT ACTCCAGTGG AAAATGGGGG GTACAGTAAC
AGTAGTGGCG CTTCGGAACG GCGCGAACGC TCGGAGAACG CACAACAATC GCACGGAACG
CCACCGCGGC AAACAGCGAA TCAGCCACCA CACCCACTCG AGGAGACGGC ATTCGAATCG
GTCGCTGACA TCGACGCGCG GATCGAGTCT CTGCGGGAGG ACAAGGACTC GTTCGAAGAG
GTGCCGATGA CCGGGTGGCC CACGGATCGC GTCTACAGCG TCTTCTTCGA TGCTGTCGAC
TACGACGAGC GGGTGGTCGT CATTATGCTC GACGAAATCG ACAAACTCGT CGAAAAGAGC
GGCGACGACA CGCTCTACAA TCTTTCGCGA ATGAACTCCG AACTCGAGAA CTCTCGCGTC
TCGATCATCG GCATTTCGAA CGACCTCAAG TTCACTGACT TTCTCGACCC ACGCGTGAAG
TCCAGTCTGG GTGAGGAGGA GATCGTTTTC CCACCGTACG ACGCCAACCA GCTCCGGGAC
ATTCTCCAGC ATCGTTCCGA AGTCGCGTTC AAAGGGGGCG CGCTGTCTAC AGACGTCATC
CCGTTGTGTG CGGCCTTCGC TGCACAGGAA CACGGGGACG CACGCCGCGC ACTCGATCTC
CTTCGGACGG CGGGCGAACT CGCAGAACGT TCGCAAGCCG AAACCATCGT CGAAGAGCAC
GTCCGCCAGG CACAGGACAA GATCGAACTC GACCGTGTGG TCGAGGTTGT TCGGACCCTC
CCAACCCAGA GCAAACTGGT CCTGTTTGCA ATCATCCTCC TCGAGAAGAA CGGTGTACAC
AGCATCAATA CGGGCGAGGT GTTCAATATC TACAAGCGTC TCTGTGAGGA GATCGACGCT
GATGTACTCA CACAGCGCCG CGTCACGGAC CTGATTAGCG AACTCGATAT GCTCGGGATC
GTCAACGCCG TAGTCGTCTC CAAGGGACGG TACGGCCGAA CGAAGGAGAT CAGCCTCTCG
GTGCCGATCG ACGAGACGGA GGCCGTACTG CTCTCTGACT CTCGGCTTTC GGATATCGAC
GATATTCAGC CGTTCGTGCA GGCGCGCTTC GAGAACTAA
 
Protein sequence
MSDDETEQTN SNTGTDTDSH SSQRGDRASS DRSSHQSDLS SDANSNSNSN SDSTSNRPVN 
SSANADGPRN STSTSPSTST NSNTAGSDNT HGFSTDFGDT DLGDDGEDSN QGLFDDLLSG
EPIFENKEVL RPSYTPHELP HRSDQINKMA TILVAALRGE TPSNILIYGK TGTGKTASAK
FVSKELESTS QKYSVPCDVE YINCEVTDTQ YRVLAQLANK FIEKNKARID DQIESLQALR
EEVAAYDEAS ERQGSTDTPS ATQSESEHNS NISTGNDPDQ MASEEGNSSH TPVENGGYSN
SSGASERRER SENAQQSHGT PPRQTANQPP HPLEETAFES VADIDARIES LREDKDSFEE
VPMTGWPTDR VYSVFFDAVD YDERVVVIML DEIDKLVEKS GDDTLYNLSR MNSELENSRV
SIIGISNDLK FTDFLDPRVK SSLGEEEIVF PPYDANQLRD ILQHRSEVAF KGGALSTDVI
PLCAAFAAQE HGDARRALDL LRTAGELAER SQAETIVEEH VRQAQDKIEL DRVVEVVRTL
PTQSKLVLFA IILLEKNGVH SINTGEVFNI YKRLCEEIDA DVLTQRRVTD LISELDMLGI
VNAVVVSKGR YGRTKEISLS VPIDETEAVL LSDSRLSDID DIQPFVQARF EN