Gene Nmar_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1340 
Symbol 
ID5774208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1227903 
End bp1228823 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content41% 
IMG OID641316985 
Producttranscription factor TFIIB cyclin-related protein 
Protein accessionYP_001582674 
Protein GI161528848 
COG category[K] Transcription 
COG ID[COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000744308 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGTCGG GAAGGTTTTT TGATCATATC ATGAGTTCAC AAGAAATCAA ATGTCCTAGA 
TGTGGAAAAA ACGCGCTAGT CACTGATGTT GAATCGTCAG AAATATTTTG TTCAAATTGC
GGTATTGTGG TTGAAGAGAA AACTAATGAC CGCAGACCCG AAAGAGCATT TGCAAATTCA
ACTACCGATA AATCCCATAC TGGCGACAAG ACATCTTTGA CAAGACATGA CAGGGGACTT
AGCACTGTGA TTAACCCCTT TGACAAAGAT TCTGCTGGAA GTCCCTTGTC TGCGTCAATG
AAATCATCCA TGACACGACT CCGGAAATGG GATAATCGAA GTCGCATAAA GACCAATGAT
GACAGAAATT TGCAACAAGC ACTGCTGGAA TTGTCAAAAA TGAAAGAAAA ATTGTCCTTG
TCTGACGCAA TTGCTGAAAA AGCATCGTAC ATCTATAGAA AGGCACTGGA GAAAAAACTG
GTCAAGGGGC GTTCTATTGC ATCTCTTGTT GCAGCTTGTC TTTATGCCGC ATGCCGTGAA
TCAGAAGCTC CCAGGACACT CCGAGAAGTT GCAGCATCCA TAGGAATTAA ACGCAAAGAA
ATCTCTGCAA CATACCGGCT CATATTCAAA GAGTTAGACC TTAAAATGCC CGTAATTGAC
TCTGTCTCCT GTATTGCAAA AATTGCAAGC AATGCAGAAC TGTCTGAGAA AACAAAAAGA
TATGCCATAA AAATTCTAAA AAAGGCAGAA AAACAAAACA TGTCTGCAGG GAAGCATCCT
ATGGGACTGG CTGCCTCTGC ATTATACCTG GCGTCAATAG ATTTGGAGGA ATTTAGGACT
CAAAAAGAAA TTGCAGATGC AGCGGGAATC ACAGAGGTTA CTGTCAGAAA CAGATGCAAA
GGTCTCAAAC AAATGATCTA A
 
Protein sequence
MSSGRFFDHI MSSQEIKCPR CGKNALVTDV ESSEIFCSNC GIVVEEKTND RRPERAFANS 
TTDKSHTGDK TSLTRHDRGL STVINPFDKD SAGSPLSASM KSSMTRLRKW DNRSRIKTND
DRNLQQALLE LSKMKEKLSL SDAIAEKASY IYRKALEKKL VKGRSIASLV AACLYAACRE
SEAPRTLREV AASIGIKRKE ISATYRLIFK ELDLKMPVID SVSCIAKIAS NAELSEKTKR
YAIKILKKAE KQNMSAGKHP MGLAASALYL ASIDLEEFRT QKEIADAAGI TEVTVRNRCK
GLKQMI