Gene Nmag_2822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2822 
Symbol 
ID8825678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2895149 
End bp2897485 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content63% 
IMG OID 
ProductSigma 54 interacting domain protein 
Protein accessionYP_003480938 
Protein GI289582472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG ATACGAACGT TGACGACCCT CCCGAGGACG CCTCCGGAGC TGCGCCGGAC 
GAGGAACAGG CGCAGGAGTC GGAGCGGGAG CCCGAGCGCA CGGAGGAGCG TTCGGAGTCT
CATGCCCCCG CATCCAACTC GCAGTCTACG GAGCAAGAGC GTCAGGGAGA CCGGTCGCCG
ACCGATGAAA CCGGTGACCC AGATGATGTC GACGGCGGAA CTGATGTCAC CAGCGACGAG
GGCGAACAGG ATAGTACGAT TACCGTCGGA AACGACGGCC TTGGAACCAG TGAGTCCGAC
GAGCGGGATG ATGCCGGTTC TACCGGTTCT GGCGGTGCTG TCGATAGTAC TGACCGTGGC
GACGACGACA TTGAGACCGT CGAAGACCTC GGTAGTACGG TCGAAGTCGA TCCAGGTGTC
GAAGTAGACG AGGAGATTGC CGAAGACGAC CTCCTCGGTG GTCTCCAGAT CGATTCGACC
GAGGACATCG AGGTCCCCGA CCGACTCGTC GATCAGGTCA TCGGGCAGGA CGAAGCGCGG
GATATCATCA TCAAGGCAGC AAAGCAGCGC CGGCACGTGA TGATGATCGG TTCCCCGGGG
ACTGGCAAGT CGATGCTGGC GAAGGCGATG AGTCAGCTCC TGCCACAGGA GGACCTGCAG
GATGTCTTGG TCTATCACAA CCCGGACGAC GGCAACGAGC CGAAGGTTCG CACCGTCCCA
GCAGGGAAAG GTGAACAGAT CATCGACGCG CACAAGGAGG AAGCGCGAAA GCGCAACCAG
ATGCGCTCGA TCCTGATGTG GATCATCATC GCGATCATCA TCGGCTACAC GATCCTCAGC
CCGGCGAGTA TCCTGATGGG GATCATCGCA GCTGGTGTTA TCTGGCTGAT CTTCCGCTAC
ACCAGCCGCG GCACGGACGC AATGGTGCCG AACATGATCG TCAACAACGG CGAGCAGCGC
CAGGCACCGT TCGAGGACGC GACCGGCGCT CACGCCGGCG CGCTGCTGGG CGACGTTCGT
CACGACCCGT TCCAGTCCGG TGGGATGGAG ACGCCATCTC ACGACCGCGT CGAACCGGGT
TCGATCCACA AGTCCAACAA GGGCGTGCTG TTCGTCGACG AGATCAACAC GCTCGACGTG
CGCACCCAGC AGAAGCTGAT GACGGCGATC CAGGAAGGCG AGTTCTCGAT CACCGGCCAG
TCCGAGCGTT CCTCGGGCGC GATGGTCCAG ACGGAGCCCG TCCCCTGTGA TTTCGTCATG
ATCGCTGCAG GGAACTTAGA CGCGATGGAG AACATGCACC CCGCACTCCG CAACCGTGTC
AAAGGATACG GGTACGAGGT CTACATGGAC GACACCATCG AGTCCACGCC GGAGATGCGC
CGGAAGTACG CCCGGTTCAT CGCCCAGGAG GTCGAACGCG ACGGTCGCCT GCCACACTTC
ACCCGTGACG CCGTCGAGGA ACTGCTCCTC GAGGCCAAGC GCCGCTCGGG CCGGAAGAAC
CACCTGACGC TGCACTTCCG CAGCCTCGGT GGACTTGTCC GCGTCGCTGG CGACATCGCC
CGCGCCGAGG ACCGCGACCG CACGACTCGC GACGACGTGC TCCAGGCCAA GCAGCGCTCC
CGGTCGATCG AACAGCAGCT GGCCGACGAC TACATCGAGC GCCGCAAGGA CTACGAACTG
CAGGTCACCG ACGACGGCGT CGAAGGCCGC GTCAACGGCC TCGCAGTCAT GGGCGAAGAC
TCGGGGATCA TGCTGCCCGT CATGGCAGAG ATCGCGCCCG CACAGGGTGG CGGTCAGGTC
ATCGCCACCG GGAAGCTCAA GGAGATGGCC GAGGAGTCCG TCCAGAACGT TTCGGCGATC
ATCAAGAAGT TCTCCGACGT TGACCTCTCG GAGAAGGACA TCCACATCCA GTTCGTCCAG
GCCGGCCAGC AGGGTGTCGA CGGAGACTCC GCCTCCATCA CGGTGGCAAC CGCCGTCATC
TCCGCACTGG AGGACATCCC GGTCAACCAG TCGGTCGCGA TGACCGGTTC GCTGTCGGTC
CGTGGCGACG TGCTCCCGGT CGGTGGGGTG ACCCACAAGA TCGAAGCCGC CGCCAAGGCC
GGCTGTAGCA AGGTCATCAT TCCGAAGGCC AACGAGCAGG ACGTGATGAT CGAAGACGAG
TACGACGAGA TGGTCGAGAT CATCCCCTGT GAAAACATCA GCGAAGTGCT CGATGTCGCC
CTCGAGGGCG AACCGAAGAA GGACTCGCTC GTCGACCGCC TGAAGTCGAT CACCGGCTCG
GCGTTCGACC AGCAGACGGT CGGCTCCGCA AGCGGGTCGA ACCCAAGTCC ACAGTAA
 
Protein sequence
MSNDTNVDDP PEDASGAAPD EEQAQESERE PERTEERSES HAPASNSQST EQERQGDRSP 
TDETGDPDDV DGGTDVTSDE GEQDSTITVG NDGLGTSESD ERDDAGSTGS GGAVDSTDRG
DDDIETVEDL GSTVEVDPGV EVDEEIAEDD LLGGLQIDST EDIEVPDRLV DQVIGQDEAR
DIIIKAAKQR RHVMMIGSPG TGKSMLAKAM SQLLPQEDLQ DVLVYHNPDD GNEPKVRTVP
AGKGEQIIDA HKEEARKRNQ MRSILMWIII AIIIGYTILS PASILMGIIA AGVIWLIFRY
TSRGTDAMVP NMIVNNGEQR QAPFEDATGA HAGALLGDVR HDPFQSGGME TPSHDRVEPG
SIHKSNKGVL FVDEINTLDV RTQQKLMTAI QEGEFSITGQ SERSSGAMVQ TEPVPCDFVM
IAAGNLDAME NMHPALRNRV KGYGYEVYMD DTIESTPEMR RKYARFIAQE VERDGRLPHF
TRDAVEELLL EAKRRSGRKN HLTLHFRSLG GLVRVAGDIA RAEDRDRTTR DDVLQAKQRS
RSIEQQLADD YIERRKDYEL QVTDDGVEGR VNGLAVMGED SGIMLPVMAE IAPAQGGGQV
IATGKLKEMA EESVQNVSAI IKKFSDVDLS EKDIHIQFVQ AGQQGVDGDS ASITVATAVI
SALEDIPVNQ SVAMTGSLSV RGDVLPVGGV THKIEAAAKA GCSKVIIPKA NEQDVMIEDE
YDEMVEIIPC ENISEVLDVA LEGEPKKDSL VDRLKSITGS AFDQQTVGSA SGSNPSPQ