Gene Nmar_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1516 
Symbol 
ID5773066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1378731 
End bp1380293 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content36% 
IMG OID641317167 
Productvon Willebrand factor type A 
Protein accessionYP_001582850 
Protein GI161529024 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00796283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAATCAG TTAAGCTTCA AAACGAATCA CTAGTAGAGA TTGCCACATT TCTTGTAAGA 
CGATGGTCTG AAAGAGACAA CATTGTTGTA GAAATCTCAG ACAAAACTGA AACAAAAACA
AGACTAAAAG AAAACAAGGT AATTCTTACA CCGCTAGAGA AAAGAGTAGG AAACGATTTT
CAAAAGTACA GACAGTTTAG AACATCACTA TGGTATGAGG CAATGAGAAT AAAGTTCTGC
AAGAAAATTC TCAGCAATGA TCATGCATTT GGATTCATCC TAAACACAAT GGAGACAAGA
CGTGTAGAGG AACTAGGAAG AAAGATTTGG AAAGGAATGG ATGATGAAAT CATCTTCAAT
TACGCCTACA TGCTTGTGGC CAGACCTCAA TTGCACACAG TTTATGGAAA AGCAAGGATT
GTTGAGGCAT TCTATCAATA TTTCATGTTT GGAGCAGTCA AGGGAGAAGT TCAGTCTAGT
CATTTTGAAA AAATTAGAAA GGCAGATGCA TTTGCAAAAA AAATGGTAAG CAAAGCAATT
GAAGAAAATC ACGACACAGA TTGGCTTGAA AAAAATGTCA GTGAAATCAT AAAAATTCTA
GAGATTGATT CTCTACTAAC AATTCCAGTA TCACTACCAT TTATGAAAGC AGGAATGCCA
CTTTCTGAAG AAGAACTGCT AAGAGTCTTG AAGATAGTTT CCAAAAACAA AGAAGGAGAC
TTTGGCAAAG TAGATCCTTC TGCAATATTG AAGGGAGAAG ATGTAATTGA TGAGTATAAT
GTTTTACTTG ATGAAGACAA GAAAACAGAG AACAAGGGAC TGATGCCTGA AGCAATAGGA
ATCCAAATCC CAACTACAAG AAATGTAGAT GAGACTGTAA TCTATGACAT GAGTTTGATT
AATGGACTAA AAACAAAATT CAAAGAATGG AAGACAGGTT GGAAGGAACA ACATGTCAGA
TCAGGAGAAG AGTTTGATGA GGAAAACTAC ATTGAAGGAA ATGAACCATT CTTTACAGAT
ATTAAAAAAT CAATCAAAAC AAAAATTGTC ATACTGTTAG ATCATTCATC TAGTATTTCG
TCGGATGCAA TTGAATACAA AAAAGCAACG CTTGCACTTT GCGAAGTCTT GGCATATCTC
AAAGTAAAAT TTGCAGTCTA TGCGTTTAGT ACAGAAAACA GATCAGTTGT TTGTTGGTCC
ATAAAACCAG ACAACATGAA ATGGAATAAC GTTACTGCAA AAAGATTGGC ACAAATAGTT
GCAAACGGTT CTACACCACT AGCTGAAGTG TATGACAAGA TGTTTCCAAT CTTACAATCA
AAGAGACCAG ACATCCTCTT GACATTGACT GATGGTGAGC CATCAGACCC TGATGCAGTC
AGAAACATGA CAAAATCACT CAAAAGTCTA GGCATAAGTA TGGTCGCCTT AGGCCTGGGA
CCAAATACTG TAAGGGCAAC AACTATTGCA AACAATCTAA GACATTTGGG GTATGAAAAA
ACAATGGCAG TAAGCCGTCT AAGAGATATT CCAAACAAGG TAATCAAGAT TTTAGATATC
TAG
 
Protein sequence
MQSVKLQNES LVEIATFLVR RWSERDNIVV EISDKTETKT RLKENKVILT PLEKRVGNDF 
QKYRQFRTSL WYEAMRIKFC KKILSNDHAF GFILNTMETR RVEELGRKIW KGMDDEIIFN
YAYMLVARPQ LHTVYGKARI VEAFYQYFMF GAVKGEVQSS HFEKIRKADA FAKKMVSKAI
EENHDTDWLE KNVSEIIKIL EIDSLLTIPV SLPFMKAGMP LSEEELLRVL KIVSKNKEGD
FGKVDPSAIL KGEDVIDEYN VLLDEDKKTE NKGLMPEAIG IQIPTTRNVD ETVIYDMSLI
NGLKTKFKEW KTGWKEQHVR SGEEFDEENY IEGNEPFFTD IKKSIKTKIV ILLDHSSSIS
SDAIEYKKAT LALCEVLAYL KVKFAVYAFS TENRSVVCWS IKPDNMKWNN VTAKRLAQIV
ANGSTPLAEV YDKMFPILQS KRPDILLTLT DGEPSDPDAV RNMTKSLKSL GISMVALGLG
PNTVRATTIA NNLRHLGYEK TMAVSRLRDI PNKVIKILDI