Gene Nmag_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2004 
Symbol 
ID8824846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2041339 
End bp2043093 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content55% 
IMG OID 
ProductPKD domain containing protein 
Protein accessionYP_003480137 
Protein GI289581671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.361042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTACT TCAGAATACT CGTCGTTGCA ATCGTGGTCG TATCCGTAGC CACAGCTGGT 
GGTATGGCTG TTGCAGTCGC ACAGGACGAT CCACCAGGAC CACCGGCAAG TTTCTACGGT
GAAGCCATCG ACGAAGATGG GAACGCTGCA GACAGTGATA CAACGATCGT CGCTGTCGTT
GAGGGAGAGG TTGAGGGACA GATTACTGTC GAAACCGCAG GTGAGTACGG TGGCCCTGAT
ACCTTCGACG AAAAACTCAG TCTCGACAGT GCTGCAGGTG ACGAGGTATC GTTCCACGCC
GGAAATGCAT CCGGTCCGGC AGCTCTCGAG AGTCCAGTCG ATCTAGACCC TGGACTGAGC
GAACGAGACC TCACGTTCCC GGCAGGGACG TTCGACGACA GTCCAGACGA CCCAGACGAC
GGCGATGATG AAGATAACGG TGACGACGCA GACGAGGGTG ACGACGCAGA CGAGGGTGAC
GACGCAGACA ACGGCGATGA TGAAGACAAC GGTGACGACG CAGATGTCGG TTCTGGGAAC
GTCGAGTTCC TCATTGAAAC AGACATCGAC AACGACCATG CATGCACTCA TGGGGAGTAC
GACGAGCGCA CACCGCTTGA AGCGGGCAAC TCACCTGACG ATGCACCTGT GGTTGTAGAA
GACCACGTAA TCTGGGATGT CACCTACGAG GGTGACGAAG GGTACATCCG GTTTGATAAC
ACCGAGCATT CTATCTATCC TGGTCTCGAC TCGTGGGTCT TCTACCAGGC CGGTGCCGAC
CTGTCGCCCA CCGACGGAAC CGTTCTCGAA ACTGGTGACG TAGATGAGTG TCCATCGCTT
GATGGGTACA TGGAGGTCGA GACACCATCC GATGGGGTCT TCGATATCGA AGTTACCGAA
GATGGCGACG TCACAGGACC TGCCAGCGAT CGAGAAGACG ATCAAGAAGA GCAACTCGAG
GCAGCCTTCG ATGTCGAGCC ATCGGAACCG GTTGTCAACG AGACTGTGGC GCTAAACGCC
AGCAACGCTA CGAGCGGTGA GGCAGGTATC GTTGCATACG AGTGGGTTGT TGCCGACACG
GAGTTGACCG GCGAACAGGT CAATACGACC TTCGATAGTC CTGGCGAGAT CGACATCGAA
CTGACAGTCG AGAACGATGA CGGTGACACA GATACGACCA ATAAAACGAT TCCCGTCTCG
ACGGAATCTG AGCCTCGTTT TGAGGTAACT GAGGTCGATA CTCCTGAGAC CGTCCCACCG
GGGACTCAAT TCAACGTAAC AGCGACGATT GTGAATACCG GTGAAAAGAA CGGAACGGCA
GCGGTTGCAC ACACATTCGA TGGCGAACCA GAGACCGAGC GGACGATTGA ACTCGCCAGC
AACGAGACCG GCGTCGTTCC CTTCAACGTC AGTGCACCAG AGACAGAAGG AGAATACCGG
CACACGATTA GCACACCCAT CCACAACAAA TCAGTGACAA CAGCAGTCGA AAAGACGAGC
GGAGCAGACG AGGAAACGGA CCCAGAATCA GAAGACGGAC CTGAACCCGA AGAGGAAGGG
GACGGTGAAG AAAATGGTGT AGAAACGGAC GAAACAGCCG AAGCGGAATC GGATGCAGAG
GATGAGACTG ATGATTCCGT TCCTGGCTTT GGTATTGTGA CCATGGCGGG AGTCGTTCTC
GTGCTCATCG TTTCTTTCCA TAGCCAGCAG CGGATTAGGC GTAGAAGGCA GTCACTTATT
AAAAACTCCT TCTGA
 
Protein sequence
MKYFRILVVA IVVVSVATAG GMAVAVAQDD PPGPPASFYG EAIDEDGNAA DSDTTIVAVV 
EGEVEGQITV ETAGEYGGPD TFDEKLSLDS AAGDEVSFHA GNASGPAALE SPVDLDPGLS
ERDLTFPAGT FDDSPDDPDD GDDEDNGDDA DEGDDADEGD DADNGDDEDN GDDADVGSGN
VEFLIETDID NDHACTHGEY DERTPLEAGN SPDDAPVVVE DHVIWDVTYE GDEGYIRFDN
TEHSIYPGLD SWVFYQAGAD LSPTDGTVLE TGDVDECPSL DGYMEVETPS DGVFDIEVTE
DGDVTGPASD REDDQEEQLE AAFDVEPSEP VVNETVALNA SNATSGEAGI VAYEWVVADT
ELTGEQVNTT FDSPGEIDIE LTVENDDGDT DTTNKTIPVS TESEPRFEVT EVDTPETVPP
GTQFNVTATI VNTGEKNGTA AVAHTFDGEP ETERTIELAS NETGVVPFNV SAPETEGEYR
HTISTPIHNK SVTTAVEKTS GADEETDPES EDGPEPEEEG DGEENGVETD ETAEAESDAE
DETDDSVPGF GIVTMAGVVL VLIVSFHSQQ RIRRRRQSLI KNSF