Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4040 |
Symbol | |
ID | 5541551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5240325 |
End bp | 5241752 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640896153 |
Product | nitrogenase |
Protein accession | YP_001434091 |
Protein GI | 156743962 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.342277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00949178 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAGTT GTCTCACCAT GCAGGATCGC GCTGTCGCCA TTAACCCGAC CCGTTCCTGC GCGCCAATCG GGGCAATGCT CGCCAATTAC GGCATTCACG GCGCCATTAC CATCAACCAC GGCTCACAGG GATGCGCCAC CTACCCGCGA CATCAGATGG CGCGCCACTT CCGCGAACCG GTCGAAGTCG CCACCACCTC GCTCACCGAA AAGACGACGG TCTATGGCGG CAAACAAAAC CTGCTCGCGG CGCTCAAGAA TATCTGGGAA CGATTCCATC CGACGATGAT TATGGTCTGT TCGACCTGTC TCTCCGAGAC GATCGGCGAC GACATTCCCG GAATCATCGA CGAGTTTCTG GACAAGCGCC CGGAGGTCAC CATCCCGATC CTGTCGGTCA AAACCCCCTC GTACATCGGC AACCACACGA CTGGCTTCGA CAATTTTCTC AAGGAGATCG CGCTCAATCT GCCGGATCGC CGCAAGAAGA AAGGCGAGAC CAACGGCAAG ATCAACATTA TTCCCGGCTG GGTCAATCCC GGCGACATCC GCGAACTAAA GCACATGCTG CGTGAAATGG GGCTGCACGG GTTGTGGATC ACCGATTACT CGGAAACCCT CGACGGCGGC TACTACCATC CGCGCCCCCA CTTCCCGCGC GGAGGCACGA CTGTTGAGGA ACTGCGCAAC TCCTCGAAGT CGCTGGCGAC TATCGCGCTC CAGCGCCACA TCGGTGGTGA AGCAGCGCAC ATTTATGAGC GACGCTACAA CGTCCCCGCT CACGTGCTGA CTATGCCCAT CGGCTTGAGG AACACCGATG CCTTTGTCAA CACACTGGTT GAGATCACCG ACCACACGAT CCCCGAATCG CTAGGGGTCG AACGGGCGCG CCTGCTCGAT GCACTGGTTG ATACGCATAT GTACACGACA GGATTGCGTG TTGCGCTCTA CGGCGATCCC GATATGCTTG AGGGGCTAGT CGGGCTGATC GCCGAAATGG GCATGATCCC GGCACACATC CTGACCGCCG CCGACAACCG CTCCTGGGGA GAACGGATGG TCGAACTGAC AGAGGAACTG GAGGTCGAGA GCGAGATCAT TCTCAAGGGT GATCTCCACG AACTGCACAA GCGGATCAAG CAACGACCGG TCGATCTGCT GATGGGACAC TCGAAAGGCA AATTTATCGC CGAAGCGGAA AACATCCCGC TGGTGCGAGT TGGTTTCCCG GTCGAAGATC GCTTTGGCTA CCATCGTCGA TCTATCGTTG GCTACAACGG CGCGACTGCA CTGGTCGATG AGATCACAAA TATGATCTTC GAGCGCCGTG CAACGGCGAT TGTGAGCAAC ACCCTGCTCG AAACCGGCCT CGAAAGACCA ACAGACATTC CGATCACGCT ACGCAATGGC GCCGCACACC ATCCGTAG
|
Protein sequence | MTSCLTMQDR AVAINPTRSC APIGAMLANY GIHGAITINH GSQGCATYPR HQMARHFREP VEVATTSLTE KTTVYGGKQN LLAALKNIWE RFHPTMIMVC STCLSETIGD DIPGIIDEFL DKRPEVTIPI LSVKTPSYIG NHTTGFDNFL KEIALNLPDR RKKKGETNGK INIIPGWVNP GDIRELKHML REMGLHGLWI TDYSETLDGG YYHPRPHFPR GGTTVEELRN SSKSLATIAL QRHIGGEAAH IYERRYNVPA HVLTMPIGLR NTDAFVNTLV EITDHTIPES LGVERARLLD ALVDTHMYTT GLRVALYGDP DMLEGLVGLI AEMGMIPAHI LTAADNRSWG ERMVELTEEL EVESEIILKG DLHELHKRIK QRPVDLLMGH SKGKFIAEAE NIPLVRVGFP VEDRFGYHRR SIVGYNGATA LVDEITNMIF ERRATAIVSN TLLETGLERP TDIPITLRNG AAHHP
|
| |