Gene RoseRS_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1198 
Symbol 
ID5208150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1468054 
End bp1469481 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content59% 
IMG OID640594816 
Productnitrogenase 
Protein accessionYP_001275555 
Protein GI148655350 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.403437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCT GTCTGACCCT TCAGGAACGA GCCGTTGCCA TCAACCCGAC CCGTTCCTGC 
GCGCCAATCG GGGCAATGCT CGCCAATTAC GGCATTCACG GCGCTATCAC CATCAACCAC
GGCTCCCAGG GATGCGCCAC CTACCCGCGT CACCAGATGT CACGTCACTT CCGCGAGCCG
GTTGAGGTCG CCACCACCTC GCTCACCGAA AAGACAACGG TCTACGGCGG CAAGCAGAAC
CTGCTCGCGG CGCTCAAGAA TATCTGGGAA CGCTTCCACC CGACCATGAT CATGGTCTGT
TCGACCTGTC TCTCAGAAAC GATCGGGGAC GACATTCCTG CGATCATCGA TGAGTTTCTG
GACAAACATC CGGACGTCAC AATCCCGATC CTTTCGGTCA AAACTCCGTC GTACATCGGC
AATCACACGA CCGGCTTCGA CAACTTTCTC AAAGAGATCG CCCTCAATCT CCCGGATCGC
CGCAAGAAGA AAGGCGAGAC CAACGGCAGG ATCAACATCA TTCCCGGCTG GGTCAACCCC
GGCGACATCC GCGAACTGAA GCATATGCTG CGGGAGATGG GGTTGCACGG GCTGTGGATC
ACCGACTACT CAGAGACCCT CGACGGCGGC TACTACGACC CGCGTCCCCA TGTTCCGCGC
GGCGGCACGA CCATCGAGGA ACTGCGCAGC TCCTCGAAAT CGCTGGCAAC GATTGCGCTC
CAGCGCCACG TTGGCGGCGA AGCGGCGCGC ATCTACGAAC GACGCTACAA CGTGCCCGCC
CATGTGTTGA CCATGCCCAT CGGGCTGAAG AATACCGATG CTTTCGTCAA CACGCTGATC
GAGATCACCG ACCATACGAT CCCCGAATCG CTGGAAGTCG AGCGGGCACG CCTGCTCGAT
GCACTGGTCG ATACGCATAT GTACACTACC GGACTGCGGG TCGCACTCTA CGGCGATCCC
GACCTGCTCG AAGGGTTGGT CGGGCTGATC GCCGAAATGG GTATGACCCC GGCATACATT
CTGACCGCTG CCGACAATCG TCCCTGGGGC GAACGAATGG TCGAACTGAC GGGAGAACTG
GGGGTTGAGA GCGAGATCAT TCTCAAGGGT GATCTGCACG AATTGCACAA GCGTATCAAG
CAGCAACCGG TTGATCTGCT CATCGGGCAC TCGAAAGGCA GGTTCATCGC CGAAGCCGAG
AACATTCCGC TGGTGCGGGT TGGATTCCCG GTTGAAGACC GCTTCGGACA TCATCGACGA
TCGATTGTCG GTTACAACGG TGCGATTGCG CTGGTCGATG AGATCACCAA CACGATCTTT
GAGCGCCGCG CAACGACCAT CGTGAGCAAC ACCCTGATCG AAACCGGCGT CGAGGGACCG
ACTTCGGTTC CAATTGCGCT GCGCAACGGC ACGACGGCGC ACGGATAG
 
Protein sequence
MTSCLTLQER AVAINPTRSC APIGAMLANY GIHGAITINH GSQGCATYPR HQMSRHFREP 
VEVATTSLTE KTTVYGGKQN LLAALKNIWE RFHPTMIMVC STCLSETIGD DIPAIIDEFL
DKHPDVTIPI LSVKTPSYIG NHTTGFDNFL KEIALNLPDR RKKKGETNGR INIIPGWVNP
GDIRELKHML REMGLHGLWI TDYSETLDGG YYDPRPHVPR GGTTIEELRS SSKSLATIAL
QRHVGGEAAR IYERRYNVPA HVLTMPIGLK NTDAFVNTLI EITDHTIPES LEVERARLLD
ALVDTHMYTT GLRVALYGDP DLLEGLVGLI AEMGMTPAYI LTAADNRPWG ERMVELTGEL
GVESEIILKG DLHELHKRIK QQPVDLLIGH SKGRFIAEAE NIPLVRVGFP VEDRFGHHRR
SIVGYNGAIA LVDEITNTIF ERRATTIVSN TLIETGVEGP TSVPIALRNG TTAHG