Gene RoseRS_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1199 
Symbol 
ID5208151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1469478 
End bp1470908 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID640594817 
Productnitrogenase component I, alpha chain 
Protein accessionYP_001275556 
Protein GI148655351 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCA AATGCAATCA GACCCTGCCT GAGCGAGCGA TCCATATCGC GCTCAAGGGA 
CCGGGCGGGA AGTGTCAGCG CGGCGATGGC ACCACCTGTT TCATTGCCAA CAACGTGGCA
ACGACGCCTG GCGATATGAC CGAGCGCGGC TGCACCTACG CCGGCTGTCG CGGCGTCGTC
GGCGGACCGG TCAAGGACGC TATTCAACTG ACCCACGGAC CGATCGGGTG CGCGTTCTTC
TCCTGGGGCT ACCGTCCGCA CCTCGCCGAC AGCGATTTTC ACATGAAGTA CACCTTCGTC
ACCGATATGA ACGAAACCAA CATCGTCTTC GGCGGCGAGA AAAAGCTGCT TCAGTCGATC
ATCGAAGCCA ATGCCGAGTT TCCCAATGCG AAGGCGGTGT TCGTCTACAA CACCTGCTCT
ACGGCGCTGA TCGGCGATGA CGGGCGCGAC GTCGCCAAAC AGGCGGAAGC GATCATCGGC
AAACCGGTGG TGTTCTTCGA GTGCGAGGGG TTTCGTGGCG TCAGTCAGTC GATGGGGCAC
CACGTCGGCA ACGAGACGAT CTTTCGTCAA CTGGTCGGCT CGGTCGAACC GGAGGGCGAT
TTCAGCCGCT CGATCAACAT CATCGGCGAC TACAACATCA AGAATGACAT CCGCACCTTC
GAGTATCTCT TCGAGGCGCT TGGCTTGCGG ATCATCGCCC GCTTCACCGG GAACGTCTCA
GTGGACGACC TGAAGATCAT GCACAAAGCG GCGCTCAATA TCGTGCACTG CCAGCGATCC
GCCACCTACA TCGCCGACAT GATGAAGGAT AAGTATGGCA CGCCGTACAT CAATGTCACG
CTCTGGGGCA TGAAGAACAT GGCAAAAGCG CTGCGCGACA CCGCCGCGTT CTTCGGGCTT
GAAGCGCGCG CCGAAGAAGT GATCGCCCGA GAAGTGGCGC GCATTCAACC CTACATCGAC
GCCTATCGTC AACGCCTGCA GGGGAAGCGC GTCTTCATCT ACCAGGGCGG TCCGCGCGTC
TGGCACTGGA TCGAACTCCT GCGCGAATTG GGCATGGAGA CCGAGACGGC AGCCACAACC
TTCGGGCATA CCGATGATTA CGAGAAGATA TTCAACCAGA TCGGCGAAGG CGCGCTGGTT
ATCGACAACC CGAATGTCCC CGAAATCGAA GAAATCCTGA CCCGTCGCCG TCCCGACCTG
TTCATCTCAG GCAACAAGGA GCGATACCTG GCATACAAAA TGGGCGTGCC GTTCGTCAAT
GGGCATACCT ACGACACCGG ACCCTACGCC GGTTTTGTGG GCATGGTCAA CTTCGCGCGC
GATATCGATA AAGCCCTGCA TGCGCCGGTC TGGAATATCG TGCATCAGCA CGCCCGACCT
GCACCCGTCG CCCGCCACGC AGTCCACGGA TCTGAGGAGG TGGAGTCATG A
 
Protein sequence
MQFKCNQTLP ERAIHIALKG PGGKCQRGDG TTCFIANNVA TTPGDMTERG CTYAGCRGVV 
GGPVKDAIQL THGPIGCAFF SWGYRPHLAD SDFHMKYTFV TDMNETNIVF GGEKKLLQSI
IEANAEFPNA KAVFVYNTCS TALIGDDGRD VAKQAEAIIG KPVVFFECEG FRGVSQSMGH
HVGNETIFRQ LVGSVEPEGD FSRSINIIGD YNIKNDIRTF EYLFEALGLR IIARFTGNVS
VDDLKIMHKA ALNIVHCQRS ATYIADMMKD KYGTPYINVT LWGMKNMAKA LRDTAAFFGL
EARAEEVIAR EVARIQPYID AYRQRLQGKR VFIYQGGPRV WHWIELLREL GMETETAATT
FGHTDDYEKI FNQIGEGALV IDNPNVPEIE EILTRRRPDL FISGNKERYL AYKMGVPFVN
GHTYDTGPYA GFVGMVNFAR DIDKALHAPV WNIVHQHARP APVARHAVHG SEEVES