Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1691 |
Symbol | |
ID | 5774519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1550111 |
End bp | 1551889 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641317345 |
Product | H+transporting two-sector ATPase alpha/beta subunit central region |
Protein accession | YP_001583025 |
Protein GI | 161529199 |
COG category | [C] Energy production and conversion |
COG ID | [COG1155] Archaeal/vacuolar-type H+-ATPase subunit A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.658723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTC AAGGTAGAAT TGTTTGGGTA AGTGGACCTG CAGTTAGAGC AGACGGTATG TCTGAAGCAA AAATGTATGA AACTGTAACT GTCGGCGATT CAAAATTAGT CGGTGAAGTA ATTAGATTAA CCGGAGATGT AGCATTCATT CAAGTTTACG AATCAACCAG TGGACTAAAA CCAGGTGAAC CAGTAATTGG TACTGGAAAC CCACTTAGTG TTTTGTTAGG TCCTGGAATT ATTGGACAAC TTTATGATGG AATTCAAAGA CCACTAAGAG CATTATCAGA AGCTTCTGGT TCCTTTATCG GAAGAGGTAT TACAACTACT CCAGTTGACA TGGCCAAAAA ATATCACTTT GTCCCATCAG TTAGTAATGG TGATGAAGTA GCAGCAGGAA ATGTAATTGG CGTTGTTCAA GAAACTGATC TTATTGAGCA CTCTATCATG GTTCCACCAG ACCACAAAGG TGGAAAAATT TCAAACTTAG TATCAGAAGG AGATTATGAT TTAGAAACTG TATTAGCAAC AACTGAGGGA GAAGGTGAAA CTGTCGAACT CAAAATGTAT CACAGATGGC CTGTAAGAAA ACCACGTCCA TACAAGAACC GATACGATCC AACAGTTCCA CTACTCACCG GACAACGTGT AATTGACACA TTCTTCCCAA TTGCAAAAGG AGGAACAGGT TCAATCCCAG GTGCATTTGG AACAGGAAAG ACTGTTACAC TTCACCAAAT TGCAAAATGG GCAGATTCCC AAGTTGTAGT TTACATCGGT TGTGGTGAGA GAGGAAACGA AATGACAGAA GTACTTGTGG AGTTCCCACA CCTCAAAGAC CCACGTAGTG GAAAACCACT CATGGACAGA ACAGTACTTG TTGCAAATAC CAGTAACATG CCGGTAGCAG CAAGAGAAGC AAGTATCTAC ACTGGTGTCA CAATTGCAGA ATATTACAGA GATATGGGTA AAGACGTTGT ACTTGTAGCA GATTCAACAA GTAGATGGGC CGAAGCACTC AGAGAGATGA GTGGTAGACT AGAAGAGATG CCAGCAGAAG AAGGCTATCC ATCATATCTT GCATCAAGAT TAGCAGAATT CTATGAAAGA GCAGGTCGTG TTAGAGCAGC AGGAAGTCCA GACCGTGATG GTTCTGTAAC TTTGATTGGT GCTGTTTCAC CATCTGGTGG TGACTTTACA GAACCAGTTA CAACTCACAC CATGAGATTT ATCAAAACAT TCTGGGCTTT GGATGCAAAA CTAGCATACT CTAGACACTA TCCATCAATT AACTGGATGA ACAGCTATTC TGGTTATCTT GCAGACATTG CAAAGTGGTG GGGTGAGAAC ATCAACGAAG ACTGGCTAAG TCTTAGAAGT GAAGTTTATG GTGTCTTACA AAGAGAAGAT ACACTAAAAG AAATTGTCAG ACTCTTAGGA CCTGAAGCAC TTCCAGATGA AGAAAAATTA ATTCTTGAAG TTGCAAGAAT GGTAAAGATT GGTCTCTTAC AACAAAACTC ATTTGATGAT GTTGACACTT ATTGTAGCCC AGAGAAACAA TACAAGTTAA TGAAATTACT AGTTGACTTT TACAAGAAAG GTCAACAAGC AATCAAGGAA GGAACTCCTC TTGCAGATAT TCGTGCAATG AAAAGTATCA CAACACTTCT CAAAGCAAGA ATGGATGTCA AAGATGATGA GATGCCAAAA CTTGATCAAC TAGATGCAGA CATGCAAGAA GAATTCAAAT CAATTACAGG AGTGAAAGTA TCAAATTGA
|
Protein sequence | MAAQGRIVWV SGPAVRADGM SEAKMYETVT VGDSKLVGEV IRLTGDVAFI QVYESTSGLK PGEPVIGTGN PLSVLLGPGI IGQLYDGIQR PLRALSEASG SFIGRGITTT PVDMAKKYHF VPSVSNGDEV AAGNVIGVVQ ETDLIEHSIM VPPDHKGGKI SNLVSEGDYD LETVLATTEG EGETVELKMY HRWPVRKPRP YKNRYDPTVP LLTGQRVIDT FFPIAKGGTG SIPGAFGTGK TVTLHQIAKW ADSQVVVYIG CGERGNEMTE VLVEFPHLKD PRSGKPLMDR TVLVANTSNM PVAAREASIY TGVTIAEYYR DMGKDVVLVA DSTSRWAEAL REMSGRLEEM PAEEGYPSYL ASRLAEFYER AGRVRAAGSP DRDGSVTLIG AVSPSGGDFT EPVTTHTMRF IKTFWALDAK LAYSRHYPSI NWMNSYSGYL ADIAKWWGEN INEDWLSLRS EVYGVLQRED TLKEIVRLLG PEALPDEEKL ILEVARMVKI GLLQQNSFDD VDTYCSPEKQ YKLMKLLVDF YKKGQQAIKE GTPLADIRAM KSITTLLKAR MDVKDDEMPK LDQLDADMQE EFKSITGVKV SN
|
| |