Gene Nmag_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3931 
Symbol 
ID8826801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp334625 
End bp335818 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content66% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003482034 
Protein GI289583624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACACT CACGCACAGC CGCAGTTACC GGGATCGGAT TACTCCCGAA CGGAACGCAC 
TCGAGTCCCG AACGAGAACT CGCACTGACT GTCCTTCATG ACGCGCTGGC GGACGCGAGA
TTGTCGCCCG AAGCGATCGA CGGGCTCTAC ATGCCCGCGC CGCGGCCGTG GGCGGCGCAG
AAGTTCGTCT CGACCACGCT CGTCCACAGA CTCGGCATCG AACCCGACCG GACGCTCGAG
GTGTCGACCG GCGGCTCGAG TAGCGCAAAC GCGTTTCAGA CCGCTGTCCA CGACGTTCGA
CACGGCGTCG TCGACACGGC GGTCGTCCTC GCCGCCGAAC GGAGTTCGAT CGTTGAGACG
ACCGGTCCCT ACTTCGAGTA CATCCTCAGC ACGTTCGACG CGGAGTTCGA ATCCCCAATC
GGGCTATCGG TGCCGGGGGC CTACGCCCAG AGCATGCAGC GGTACTGTTA CGAACACGAT
ATCGACCGCG ACGATATCGC CGACATCGTC GTGAAAAACC GCGAGAACGC GGCCGACGAT
CCGGACACCC TGTTCAGCGA CGGGGTCGAT CGGGTGGACG TACTCGAGTC CCGGCCGATC
GCAGAGCCGA TTCGGCTGTA CGACTGTCCG GCACCGTGTG ATGGGGCGGC GGCGCTGGTG
GTGACAGCCG ACGATGGTGG GGAGACGGAC ACAGAATCTG GAAACGGTGG CGACCCACCG
ATCACGGTCG CCGGAGTCGG CAGCCACCAC GCGGCGAGTC ACTTCCTGCA GACCCACGGC
GAGCCGATCA CCGAACTCCC CGCGGTTCGG CGAGCAGCCC GGACGGCGAG CCAGGAGGCC
GGACTGGCAC CAGATGAGCT GGACGTCTAC GAGCCGTACG CGCCGTTTCC GCACATCGAG
GCGATCATCA CCGAGGAACT CGGTCTGGTC GACCGCGGGG AGGGCGTCAC AGCGTGTCTC
GACGGTCAAA CGCGACCTGA CGGTTCGTTC CCGATCAGCC CCTCCGGCGG CTGTCTCGGC
CGGGGCCACC CGCCGATGGT AACGCCGTTG TACAACTACG TCGAGGCCGT CCGCCAGCTC
AGGGGAACGG CCTCGACGCA GATTGTAGAC GCCGAGCACG TCATGACGAC CGCAGAGCAC
GGCCACGTCA ACGGCGCGAC CGCCACCGTC TTCGCGAGAG GGAGGGGTGC GTAG
 
Protein sequence
MGHSRTAAVT GIGLLPNGTH SSPERELALT VLHDALADAR LSPEAIDGLY MPAPRPWAAQ 
KFVSTTLVHR LGIEPDRTLE VSTGGSSSAN AFQTAVHDVR HGVVDTAVVL AAERSSIVET
TGPYFEYILS TFDAEFESPI GLSVPGAYAQ SMQRYCYEHD IDRDDIADIV VKNRENAADD
PDTLFSDGVD RVDVLESRPI AEPIRLYDCP APCDGAAALV VTADDGGETD TESGNGGDPP
ITVAGVGSHH AASHFLQTHG EPITELPAVR RAARTASQEA GLAPDELDVY EPYAPFPHIE
AIITEELGLV DRGEGVTACL DGQTRPDGSF PISPSGGCLG RGHPPMVTPL YNYVEAVRQL
RGTASTQIVD AEHVMTTAEH GHVNGATATV FARGRGA