Gene Nmag_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3999 
Symbol 
ID8828733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp42177 
End bp43526 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content50% 
IMG OID 
ProductCapsule synthesis protein, CapA 
Protein accessionYP_003482094 
Protein GI289937492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0396426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA ACCCATTAAC ATTCACAGCC GTTGGTGATG CGATTGTCAC ACAGAAGTTT 
TCAGTTTATG AGGAGGAGTC ATTTAATGAG CTAATTGATC AGATACAAGA TCAAGATGTT
TCCGTTGCAA ATCTAGAAGT CCTTCTCCAC AATTTCGAAG GCTATCCTGC TGCTCAGAGT
GGGGGAACAT ACATGCAAGC CCCTCCAGAG ATTGCAGATG AACTCGAGTG GGCAGGATTT
AACCTGTTCT CTGCCGCGAC AAACCACGCG GGAGACTTCT CACACGGGGG GATGGAGGCA
ACAATGCAAG CCCTCGAGGA GCGGAACATG AGTTATGCCG GCATGGGGCG GAATCTTGCA
CAAGCTCGTG CCCCGACGTA CCTGGATACC CCAAAAGGAC GGGTAGCGCT TATCTCAGCA
TGTACAACAA TTACAACAGG AACGGAAGCA GGACTCCAAC GGCCGGATAT GCAGGGCCGA
CCTGGAATTT CCCCTCTTCA TCTCCAGACA CGGTATACAG TTCCTGAGGA GTTCCATGAG
GAGCTTATCC ACGCCAGCAA GAAGCTTGGT CTCGAAGCAA TCAAAGACCG GAAACGGGAG
CTCGGGTTCC AGGTTCCAGG TGAGGATAGT GACGGGTTTA CGTTTCTCAA TATAGGAGGA
GAGACAGATA TACAGTTTGA GTTGGGCGAT CGTTTCGATA TCCATCAGGA GGTCAATGAC
GAAGATGCAG AGTCAATTAC GAAGCAAATT CAGGCCGCGA AGCGCCAAGC AGATTGGGTA
TTTATCAGTC TTCATTCACA TGAAGGGACG GGTGGGTCTC GCAATGACGA TACTGTCCCA
CAGTTTTTGG AATCGTTCGC AAGAAACTGT ATTGATGCTG GTGCTGATGG ATTCATTGGA
CACGGTCCAC ACGTTCTTCG AGGGATAGAG ATCTACAGGG GAGCGCCGAT TTTCTACAGT
CTCGGGAATT TCTTCATGCA AAACGAGACA ATCCCGAACC TACCTGCAGA GATCTACGAT
CGGTATGACC TCGACCCCTA CCAGTCATTG CCAGCTGATC TTTTTGATGA GCGAATCTTC
AACGATGAAC AGCAGCGCCA AGGATTCACG GCTGATCGGA AGTTTTGGGA GTCAGTACTT
CCAATCTGCG AATTCGGTGA AGATGGTGTT GAGTCAATTG AACTTCTCCC CTTAGATCTT
GGGTATGAGC GGTCGCGGCC TCAACGTGGT CGTCCAATGC TCGCTGGGCC AGATGTTACT
GATCACGTCT TCGCAACTGT CAACGAACTT TCGTCGCAAT ATGGAACCGA ATTCACGGAA
GATGGGCCTG TTCTCAGAGT CGATCTGTAG
 
Protein sequence
MQKNPLTFTA VGDAIVTQKF SVYEEESFNE LIDQIQDQDV SVANLEVLLH NFEGYPAAQS 
GGTYMQAPPE IADELEWAGF NLFSAATNHA GDFSHGGMEA TMQALEERNM SYAGMGRNLA
QARAPTYLDT PKGRVALISA CTTITTGTEA GLQRPDMQGR PGISPLHLQT RYTVPEEFHE
ELIHASKKLG LEAIKDRKRE LGFQVPGEDS DGFTFLNIGG ETDIQFELGD RFDIHQEVND
EDAESITKQI QAAKRQADWV FISLHSHEGT GGSRNDDTVP QFLESFARNC IDAGADGFIG
HGPHVLRGIE IYRGAPIFYS LGNFFMQNET IPNLPAEIYD RYDLDPYQSL PADLFDERIF
NDEQQRQGFT ADRKFWESVL PICEFGEDGV ESIELLPLDL GYERSRPQRG RPMLAGPDVT
DHVFATVNEL SSQYGTEFTE DGPVLRVDL