Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4041 |
Symbol | |
ID | 5541552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5241749 |
End bp | 5243179 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640896154 |
Product | nitrogenase component I, alpha chain |
Protein accession | YP_001434092 |
Protein GI | 156743963 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00949178 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGTTCA AGTGCAATCA GACCCTGCCT GAGCGAGCGA TCCATATCGC GCTCAAAGGA CCGGACGGGA AGTGCCAGCG CGGTGATGGA ACCGGCTGCT TCATCGCCAA CAATGTTGCA ACCACCCCCG GTGATATGAC CGAGCGTGGT TGCACCTACG CCGGCTGTCG CGGCGTCGTC GGCGGGCCGG TAAAGGATGC TATTCAACTG ACGCACGGAC CGATCGGGTG CGCATTCTTT TCGTGGGGCT ACCGTCCGCA CCTCGCCGAC AGCGATTTTC ACATGAAATA CACCTTCGTC TCCGACATGA ACGAAACAAA CATCGTCTTC GGCGGCGAGA AGAAATTGCT GCAATCGATC ATCGAAGCCA GCGCCGAATT TCCCGACGCA AAGGCGGTGT TTGTCTACAA CACCTGCTCC ACGGCACTGA TCGGCGACGA CGGGCGTGAT GTCGCCAAAC AAGCGGAAGC GATCATCGGC AAGCCAGTCG TGTTCTTCGA GTGCGAGGGG TTTCGCGGTG TCAGCCAGTC GATGGGACAC CACGTTGGCA ACGAAACGAT CTTTCGCCAA CTGGTCGGTT CGATCGAGCC GGAGGGTGAT TTCAGCCGTT CGATCAATAT CATCGGCGAC TACAACATCA AGAATGACAT CCGCACCTTC GAGTATCTCT TCGAGGCGCT CGGCTTGCAG ATCATCGCTC GTTTTACCGG GAATGTCTCG GTGGATGACC TGAAGATCAT GCACAAGGCG GCGCTCAACA TCGTGCATTG CCAGCGTTCA GCCACGTACA TCGCCGATAT GATGAAGGAG AAGTATGGCA CACCGTCTAT CAACGTCACC CTTTGGGGCA TCAGGAATAT GGCGCAGGCG TTGCGCGCCG CCGCCGCATT CTTTGGGCTT GAAACGCGCG CCGAAGAGGT GATTGCCCAC GAAGTCACCC GCATTCAACC CTATATCGAC GCATACCGCC AGCGATTGCA TGGAAAGCGC GTCTTCATCT ATCAGGGAGG CCCGCGCGTC TGGCACTGGA TCGAACTCCT GCGCGAATTG GGCATGGAGA CCGAAACGGC AGCCACAACC TTCGGGCATA CTGACGACTA CGAGAAGATT TTCAATCAGA TCCCAGAAGG CGCGCTGGTG ATCGACAACC CCAACGTTCC CGAAATCGAA GAAATTCTGA ACCGACGTCG CCCCGACCTG TTCATCTCGG GCAACAAGGA GCGGTACCTG GCGTATAAAC TCGGCGTGCC ATTCGTCAAT GGGCATACTT ACGATACCGG ACCCTATGCC GGCTTCGTAG GCATGGTCAA CTTTGCGCGC GACATCGATA AAGCGCTGCA TGCGCCGGTC TGGAACATCC TGCATCAGCG CGCCCGCCCC GCGCCCGCTA CGCATCACGC AGCGCACGGT TTTGAGGAGG TGGAGTCATG A
|
Protein sequence | MQFKCNQTLP ERAIHIALKG PDGKCQRGDG TGCFIANNVA TTPGDMTERG CTYAGCRGVV GGPVKDAIQL THGPIGCAFF SWGYRPHLAD SDFHMKYTFV SDMNETNIVF GGEKKLLQSI IEASAEFPDA KAVFVYNTCS TALIGDDGRD VAKQAEAIIG KPVVFFECEG FRGVSQSMGH HVGNETIFRQ LVGSIEPEGD FSRSINIIGD YNIKNDIRTF EYLFEALGLQ IIARFTGNVS VDDLKIMHKA ALNIVHCQRS ATYIADMMKE KYGTPSINVT LWGIRNMAQA LRAAAAFFGL ETRAEEVIAH EVTRIQPYID AYRQRLHGKR VFIYQGGPRV WHWIELLREL GMETETAATT FGHTDDYEKI FNQIPEGALV IDNPNVPEIE EILNRRRPDL FISGNKERYL AYKLGVPFVN GHTYDTGPYA GFVGMVNFAR DIDKALHAPV WNILHQRARP APATHHAAHG FEEVES
|
| |