Gene Nmar_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1144 
Symbol 
ID5774510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1046995 
End bp1047993 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content38% 
IMG OID641316787 
Productcytochrome c biogenesis protein transmembrane region 
Protein accessionYP_001582478 
Protein GI161528652 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0785] Cytochrome c biogenesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0421385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACGG TGGTAATTGC AAAAAAATCA ATGGTAATAA TTTCATTTGC ATTATTTTCA 
TTAATCTTTC TTGGAATAAT TTTTTCGCTT GGAACAAATT TTACAATAGA AGGTAAAGAA
CACACAACAT ACCTCTCATG GATAGTAATT GCATATGTTG CAGGATTGTC CATGATTGTT
CTTCCATGTA CACTGCCACT GGTATTCATC ATAGTTCCAC TAAGTATGGG ACAAGGGTAC
AAGAAAGGAT TAGCCATGGC ATTACTTTTT GGTCTAGGAC TAGTCATTAC AATTGCATCT
TATGGAATTG CAATTGCAGG AATTGGGCAA AGTGCATCAC TAGACCAAGC ATCAACTATC
ATGTTCTTAA TTGCAGGAAT TGCGGCATTT GTCTTTGGAT TATCACAACT AAAAATTATT
TCATTAAAGC TACCATCGTA TTCAGGAACT CCAAAGTTTA TCCAGAACAG AGGAGAATAT
ACAAAATCAT TTTTCATGGG ATTACTATTA GGAAATGCAG GAGTTGGATG CCCCAATCCG
TTGTTTTACT GGCTACTAAT CTACATTGCA GGAACAGGCA GTATCGAAGT CGGAGCTTCA
TTAGGAGTAG TTCACGGAGT TGGAAGGGCA ATTCCCCTAA TTTTGATGTC AGTTCTTGCA
GTAATTGGAA TCAATGCAAC AAAGAGTTTG ACTCTAAAAC GAGAATCAAT TGAGCGAGCA
TCAGGATGGA TGCTAATAGT GATTGGGGCA TTTTTGATAA TCAACGGACT GCCAGAGGGA
CACGAATGGT ACGAAGAACT ATTCATCCAT CAAGGATGGA ATCAACTCGT TGAGATGACA
GGAATACCAG CAGAATTTGA GATGGACGAA CATACACATG ACCACGGACA TGTAGAAGGA
AGAGATTTCA AAGTATTTTA CACAGCTTTG TTAGCGGTAT TGGTATTGAG TCCGTTGTTC
ATACGTTCAG TTAGAAAAAT CAGGGAGGTG AATGCATGA
 
Protein sequence
MSTVVIAKKS MVIISFALFS LIFLGIIFSL GTNFTIEGKE HTTYLSWIVI AYVAGLSMIV 
LPCTLPLVFI IVPLSMGQGY KKGLAMALLF GLGLVITIAS YGIAIAGIGQ SASLDQASTI
MFLIAGIAAF VFGLSQLKII SLKLPSYSGT PKFIQNRGEY TKSFFMGLLL GNAGVGCPNP
LFYWLLIYIA GTGSIEVGAS LGVVHGVGRA IPLILMSVLA VIGINATKSL TLKRESIERA
SGWMLIVIGA FLIINGLPEG HEWYEELFIH QGWNQLVEMT GIPAEFEMDE HTHDHGHVEG
RDFKVFYTAL LAVLVLSPLF IRSVRKIREV NA