Gene Nmag_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2006 
Symbol 
ID8824848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2044653 
End bp2045768 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content58% 
IMG OID 
ProductABC-3 protein 
Protein accessionYP_003480139 
Protein GI289581673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.522018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTG ACGAAAACGG GCCCATTGCT GCCGGTGAAC CGGCCGAATG GAGTCGCAGT 
CGTTTCGAAC AGTGGAGCGG CTACTCACTG CGCAAGCTGA TCGAACTGGT CGGTGCGGTT
GTCACGATTG GCCTCGCCGT CGCCATGCTC GGATTCATCA CGCTCGATTG GCTTCGGTTC
GCACCGGAGT GGGCAGTCAT TGGCTCCTAC GCGGAGTTGT TGCTTGGGCT GTTCCTGACT
GGCGGAGCGT GGCTGGATAC GTCCCTGGGA ACGAACGTGT TCCAGTACTT CTTCACGTGG
CGGGTAATCG CAACGGGTGT CCTCGTCGGG ATCGCTGCGC CACTCATCGG AACGTTTCTG
ATCCATCGAC AGATGGCCCT TATCGGCGAA ACGCTCGCAC ACACAGCGTT TACCGGTGTT
GCTATCGGAG TACTACTCGT CGCTGTTACC GGCTGGACTG GATCTCTCTT GTTCGTCGCA
CTTATCGTGA GTGTACTCGG TGCGCTTGTA CTCCAGTGGT TGACCGAACA CACCGCGGCC
TATGGCGACG TCCCCATCGC AATCGTCCTC AGCGGGAGTT TCGCAATCGG AACACTGCTC
GTCAGTTGGA GCCGAGATTT CGCTTCGGTG TCGCTCAATA TCGAGGGGTT CCTCTTTGGC
AGCCTCGCAA TTATCACTGC CGAAGGCACG CGGATGGTCG CCATACTCAC CGTTGCCGTC
GTTGCCGTTG TCGCGGTCAC CTACAAGCAA CTGCTGTTCA TCACGTTCGA CGAGCAGGCT
GCCCGCGTTG CGCGGCTCAA CGTCGACCGT TACAACACGC TGCTAATTGG GATGGCTGCA
GTCATCGTCG TCGGTGCGAT GCAAATCCTC GGTGTTATTC TCGTTGCGGC GATGCTCGTT
ATCCCGGTCG CAACGGCCTC ACAGATTGCC AACAGCTTCC GGGAAACGTT ATTGCTCTCT
GTCCTGTTTG GACAGGGTGC AGTTCTCGGC GGGCTAGCGT TTTCGATCAG AACGAACCTC
CCGCCTGGCG GTTCAATTGT CGTCGCCGGA ATCGTCTTTT ACGGGCTTAC TATCGTCCTC
TCAGACCGAT CCGCGGTTGC AATCTCTACA CACTAA
 
Protein sequence
MSADENGPIA AGEPAEWSRS RFEQWSGYSL RKLIELVGAV VTIGLAVAML GFITLDWLRF 
APEWAVIGSY AELLLGLFLT GGAWLDTSLG TNVFQYFFTW RVIATGVLVG IAAPLIGTFL
IHRQMALIGE TLAHTAFTGV AIGVLLVAVT GWTGSLLFVA LIVSVLGALV LQWLTEHTAA
YGDVPIAIVL SGSFAIGTLL VSWSRDFASV SLNIEGFLFG SLAIITAEGT RMVAILTVAV
VAVVAVTYKQ LLFITFDEQA ARVARLNVDR YNTLLIGMAA VIVVGAMQIL GVILVAAMLV
IPVATASQIA NSFRETLLLS VLFGQGAVLG GLAFSIRTNL PPGGSIVVAG IVFYGLTIVL
SDRSAVAIST H