Gene Nmag_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0333 
Symbol 
ID8823154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp323945 
End bp325066 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID 
Productgas vesicle protein GvpN 
Protein accessionYP_003478485 
Protein GI289580019 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGG ACACCTCGCG CAAGCGCAAA GTCCGTGGCA GAAAGATTCG CGGTGACCGG 
GAGGCGAAGA AGCGCCTCAA GGCCCGAAAG AAGCTGGCTC GATCGGCATC GCAGACGAAG
ACAAAGAGCG AAACAGCCAG CAAATCGAGG GGCTCTCACA TCGCAACCGA GAGCGGAGAT
GACCACCTCA TAGACCCCGC CGACGCTGCA CCGGACCCCT TCGTCGAAAC CGACGCCGTC
GCTGCCGTTC GCGACCGGAT CACCGGGTGG CTCGCTGCCG ATCAGCCGGT TCACCTGATC
GGCCCGACCG GCGCTGGTAA GACGGCACTC GCACTGGCAG CGGCTGCAAC GCGCGGCCGC
CCGGTCGTCC TCTGCAACGG CGACGAGGCG GTCGACACGA GCGCGCTCGT CGGCGGCTAC
AGCGGCGGCG AACGCTACGA GGAGCGCGAC GAGTACGTCA GCGGCGTCAG CAAGAAGACA
CAGATCGTCC GCGACCGCTG GGTCGACAAC CCGCTCTCCG TCGCGGTCCG AGAGGGCGCA
ACGCTCGTCT ACAACGAGTT CTCCCGGAGC GACCCCGCCG CCCACAACGT CTTGCTCTCC
GTCCTCGAGG AAGGTGTACT CGAGCGGCCG GGCAAGCACG GGGCCAATAG GTCGATCGAC
GTGCATCCGG AGTTCCGCGT GATCTTCACG TCGAACGACG TGGAGTACGC GGGTGTCCAC
CAGCAACAGG ATGCACTGCT CGACCGGATG GTCGGCGTGC ACGTCGACTA CTACGACGCA
GAGACCGAGC GCGAAATCGT GCGGTCGCAC GTGGCCGTTT CCGACGAAGC GATCGAAACG
GTCGTCGACG CGACCCGGAC GCTGCGCGAG GAACTCCCGG TCGTCGTCGG GACGCGGACA
GCGATTACGG CCGCGAAGGG GATCTCGGTG TTCGACGACT GGAATGGTGA CGAGGCTGCA
CCCGAGCGGG CAGACGGCGG TCGGGTACAG GTCGACGGCG ATGACGACCT GCTCGCAGAC
GTGTTGACGG ATGTGCTCGG CCCGAAAGTC GCTGGAGCAG AGACGGAAAT CGATGGGATG
GCCGCCCTGC ACAGTCAGAT TAGCGAGGTA CTTCGGGACT GA
 
Protein sequence
MAEDTSRKRK VRGRKIRGDR EAKKRLKARK KLARSASQTK TKSETASKSR GSHIATESGD 
DHLIDPADAA PDPFVETDAV AAVRDRITGW LAADQPVHLI GPTGAGKTAL ALAAAATRGR
PVVLCNGDEA VDTSALVGGY SGGERYEERD EYVSGVSKKT QIVRDRWVDN PLSVAVREGA
TLVYNEFSRS DPAAHNVLLS VLEEGVLERP GKHGANRSID VHPEFRVIFT SNDVEYAGVH
QQQDALLDRM VGVHVDYYDA ETEREIVRSH VAVSDEAIET VVDATRTLRE ELPVVVGTRT
AITAAKGISV FDDWNGDEAA PERADGGRVQ VDGDDDLLAD VLTDVLGPKV AGAETEIDGM
AALHSQISEV LRD