Gene Nmar_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1691 
Symbol 
ID5774519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1550111 
End bp1551889 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content40% 
IMG OID641317345 
ProductH+transporting two-sector ATPase alpha/beta subunit central region 
Protein accessionYP_001583025 
Protein GI161529199 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.658723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTC AAGGTAGAAT TGTTTGGGTA AGTGGACCTG CAGTTAGAGC AGACGGTATG 
TCTGAAGCAA AAATGTATGA AACTGTAACT GTCGGCGATT CAAAATTAGT CGGTGAAGTA
ATTAGATTAA CCGGAGATGT AGCATTCATT CAAGTTTACG AATCAACCAG TGGACTAAAA
CCAGGTGAAC CAGTAATTGG TACTGGAAAC CCACTTAGTG TTTTGTTAGG TCCTGGAATT
ATTGGACAAC TTTATGATGG AATTCAAAGA CCACTAAGAG CATTATCAGA AGCTTCTGGT
TCCTTTATCG GAAGAGGTAT TACAACTACT CCAGTTGACA TGGCCAAAAA ATATCACTTT
GTCCCATCAG TTAGTAATGG TGATGAAGTA GCAGCAGGAA ATGTAATTGG CGTTGTTCAA
GAAACTGATC TTATTGAGCA CTCTATCATG GTTCCACCAG ACCACAAAGG TGGAAAAATT
TCAAACTTAG TATCAGAAGG AGATTATGAT TTAGAAACTG TATTAGCAAC AACTGAGGGA
GAAGGTGAAA CTGTCGAACT CAAAATGTAT CACAGATGGC CTGTAAGAAA ACCACGTCCA
TACAAGAACC GATACGATCC AACAGTTCCA CTACTCACCG GACAACGTGT AATTGACACA
TTCTTCCCAA TTGCAAAAGG AGGAACAGGT TCAATCCCAG GTGCATTTGG AACAGGAAAG
ACTGTTACAC TTCACCAAAT TGCAAAATGG GCAGATTCCC AAGTTGTAGT TTACATCGGT
TGTGGTGAGA GAGGAAACGA AATGACAGAA GTACTTGTGG AGTTCCCACA CCTCAAAGAC
CCACGTAGTG GAAAACCACT CATGGACAGA ACAGTACTTG TTGCAAATAC CAGTAACATG
CCGGTAGCAG CAAGAGAAGC AAGTATCTAC ACTGGTGTCA CAATTGCAGA ATATTACAGA
GATATGGGTA AAGACGTTGT ACTTGTAGCA GATTCAACAA GTAGATGGGC CGAAGCACTC
AGAGAGATGA GTGGTAGACT AGAAGAGATG CCAGCAGAAG AAGGCTATCC ATCATATCTT
GCATCAAGAT TAGCAGAATT CTATGAAAGA GCAGGTCGTG TTAGAGCAGC AGGAAGTCCA
GACCGTGATG GTTCTGTAAC TTTGATTGGT GCTGTTTCAC CATCTGGTGG TGACTTTACA
GAACCAGTTA CAACTCACAC CATGAGATTT ATCAAAACAT TCTGGGCTTT GGATGCAAAA
CTAGCATACT CTAGACACTA TCCATCAATT AACTGGATGA ACAGCTATTC TGGTTATCTT
GCAGACATTG CAAAGTGGTG GGGTGAGAAC ATCAACGAAG ACTGGCTAAG TCTTAGAAGT
GAAGTTTATG GTGTCTTACA AAGAGAAGAT ACACTAAAAG AAATTGTCAG ACTCTTAGGA
CCTGAAGCAC TTCCAGATGA AGAAAAATTA ATTCTTGAAG TTGCAAGAAT GGTAAAGATT
GGTCTCTTAC AACAAAACTC ATTTGATGAT GTTGACACTT ATTGTAGCCC AGAGAAACAA
TACAAGTTAA TGAAATTACT AGTTGACTTT TACAAGAAAG GTCAACAAGC AATCAAGGAA
GGAACTCCTC TTGCAGATAT TCGTGCAATG AAAAGTATCA CAACACTTCT CAAAGCAAGA
ATGGATGTCA AAGATGATGA GATGCCAAAA CTTGATCAAC TAGATGCAGA CATGCAAGAA
GAATTCAAAT CAATTACAGG AGTGAAAGTA TCAAATTGA
 
Protein sequence
MAAQGRIVWV SGPAVRADGM SEAKMYETVT VGDSKLVGEV IRLTGDVAFI QVYESTSGLK 
PGEPVIGTGN PLSVLLGPGI IGQLYDGIQR PLRALSEASG SFIGRGITTT PVDMAKKYHF
VPSVSNGDEV AAGNVIGVVQ ETDLIEHSIM VPPDHKGGKI SNLVSEGDYD LETVLATTEG
EGETVELKMY HRWPVRKPRP YKNRYDPTVP LLTGQRVIDT FFPIAKGGTG SIPGAFGTGK
TVTLHQIAKW ADSQVVVYIG CGERGNEMTE VLVEFPHLKD PRSGKPLMDR TVLVANTSNM
PVAAREASIY TGVTIAEYYR DMGKDVVLVA DSTSRWAEAL REMSGRLEEM PAEEGYPSYL
ASRLAEFYER AGRVRAAGSP DRDGSVTLIG AVSPSGGDFT EPVTTHTMRF IKTFWALDAK
LAYSRHYPSI NWMNSYSGYL ADIAKWWGEN INEDWLSLRS EVYGVLQRED TLKEIVRLLG
PEALPDEEKL ILEVARMVKI GLLQQNSFDD VDTYCSPEKQ YKLMKLLVDF YKKGQQAIKE
GTPLADIRAM KSITTLLKAR MDVKDDEMPK LDQLDADMQE EFKSITGVKV SN