Gene Rcas_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4041 
Symbol 
ID5541552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5241749 
End bp5243179 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content57% 
IMG OID640896154 
Productnitrogenase component I, alpha chain 
Protein accessionYP_001434092 
Protein GI156743963 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00949178 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGTTCA AGTGCAATCA GACCCTGCCT GAGCGAGCGA TCCATATCGC GCTCAAAGGA 
CCGGACGGGA AGTGCCAGCG CGGTGATGGA ACCGGCTGCT TCATCGCCAA CAATGTTGCA
ACCACCCCCG GTGATATGAC CGAGCGTGGT TGCACCTACG CCGGCTGTCG CGGCGTCGTC
GGCGGGCCGG TAAAGGATGC TATTCAACTG ACGCACGGAC CGATCGGGTG CGCATTCTTT
TCGTGGGGCT ACCGTCCGCA CCTCGCCGAC AGCGATTTTC ACATGAAATA CACCTTCGTC
TCCGACATGA ACGAAACAAA CATCGTCTTC GGCGGCGAGA AGAAATTGCT GCAATCGATC
ATCGAAGCCA GCGCCGAATT TCCCGACGCA AAGGCGGTGT TTGTCTACAA CACCTGCTCC
ACGGCACTGA TCGGCGACGA CGGGCGTGAT GTCGCCAAAC AAGCGGAAGC GATCATCGGC
AAGCCAGTCG TGTTCTTCGA GTGCGAGGGG TTTCGCGGTG TCAGCCAGTC GATGGGACAC
CACGTTGGCA ACGAAACGAT CTTTCGCCAA CTGGTCGGTT CGATCGAGCC GGAGGGTGAT
TTCAGCCGTT CGATCAATAT CATCGGCGAC TACAACATCA AGAATGACAT CCGCACCTTC
GAGTATCTCT TCGAGGCGCT CGGCTTGCAG ATCATCGCTC GTTTTACCGG GAATGTCTCG
GTGGATGACC TGAAGATCAT GCACAAGGCG GCGCTCAACA TCGTGCATTG CCAGCGTTCA
GCCACGTACA TCGCCGATAT GATGAAGGAG AAGTATGGCA CACCGTCTAT CAACGTCACC
CTTTGGGGCA TCAGGAATAT GGCGCAGGCG TTGCGCGCCG CCGCCGCATT CTTTGGGCTT
GAAACGCGCG CCGAAGAGGT GATTGCCCAC GAAGTCACCC GCATTCAACC CTATATCGAC
GCATACCGCC AGCGATTGCA TGGAAAGCGC GTCTTCATCT ATCAGGGAGG CCCGCGCGTC
TGGCACTGGA TCGAACTCCT GCGCGAATTG GGCATGGAGA CCGAAACGGC AGCCACAACC
TTCGGGCATA CTGACGACTA CGAGAAGATT TTCAATCAGA TCCCAGAAGG CGCGCTGGTG
ATCGACAACC CCAACGTTCC CGAAATCGAA GAAATTCTGA ACCGACGTCG CCCCGACCTG
TTCATCTCGG GCAACAAGGA GCGGTACCTG GCGTATAAAC TCGGCGTGCC ATTCGTCAAT
GGGCATACTT ACGATACCGG ACCCTATGCC GGCTTCGTAG GCATGGTCAA CTTTGCGCGC
GACATCGATA AAGCGCTGCA TGCGCCGGTC TGGAACATCC TGCATCAGCG CGCCCGCCCC
GCGCCCGCTA CGCATCACGC AGCGCACGGT TTTGAGGAGG TGGAGTCATG A
 
Protein sequence
MQFKCNQTLP ERAIHIALKG PDGKCQRGDG TGCFIANNVA TTPGDMTERG CTYAGCRGVV 
GGPVKDAIQL THGPIGCAFF SWGYRPHLAD SDFHMKYTFV SDMNETNIVF GGEKKLLQSI
IEASAEFPDA KAVFVYNTCS TALIGDDGRD VAKQAEAIIG KPVVFFECEG FRGVSQSMGH
HVGNETIFRQ LVGSIEPEGD FSRSINIIGD YNIKNDIRTF EYLFEALGLQ IIARFTGNVS
VDDLKIMHKA ALNIVHCQRS ATYIADMMKE KYGTPSINVT LWGIRNMAQA LRAAAAFFGL
ETRAEEVIAH EVTRIQPYID AYRQRLHGKR VFIYQGGPRV WHWIELLREL GMETETAATT
FGHTDDYEKI FNQIPEGALV IDNPNVPEIE EILNRRRPDL FISGNKERYL AYKLGVPFVN
GHTYDTGPYA GFVGMVNFAR DIDKALHAPV WNILHQRARP APATHHAAHG FEEVES