Gene Rcas_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4040 
Symbol 
ID5541551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5240325 
End bp5241752 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content58% 
IMG OID640896153 
Productnitrogenase 
Protein accessionYP_001434091 
Protein GI156743962 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.342277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00949178 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCAGTT GTCTCACCAT GCAGGATCGC GCTGTCGCCA TTAACCCGAC CCGTTCCTGC 
GCGCCAATCG GGGCAATGCT CGCCAATTAC GGCATTCACG GCGCCATTAC CATCAACCAC
GGCTCACAGG GATGCGCCAC CTACCCGCGA CATCAGATGG CGCGCCACTT CCGCGAACCG
GTCGAAGTCG CCACCACCTC GCTCACCGAA AAGACGACGG TCTATGGCGG CAAACAAAAC
CTGCTCGCGG CGCTCAAGAA TATCTGGGAA CGATTCCATC CGACGATGAT TATGGTCTGT
TCGACCTGTC TCTCCGAGAC GATCGGCGAC GACATTCCCG GAATCATCGA CGAGTTTCTG
GACAAGCGCC CGGAGGTCAC CATCCCGATC CTGTCGGTCA AAACCCCCTC GTACATCGGC
AACCACACGA CTGGCTTCGA CAATTTTCTC AAGGAGATCG CGCTCAATCT GCCGGATCGC
CGCAAGAAGA AAGGCGAGAC CAACGGCAAG ATCAACATTA TTCCCGGCTG GGTCAATCCC
GGCGACATCC GCGAACTAAA GCACATGCTG CGTGAAATGG GGCTGCACGG GTTGTGGATC
ACCGATTACT CGGAAACCCT CGACGGCGGC TACTACCATC CGCGCCCCCA CTTCCCGCGC
GGAGGCACGA CTGTTGAGGA ACTGCGCAAC TCCTCGAAGT CGCTGGCGAC TATCGCGCTC
CAGCGCCACA TCGGTGGTGA AGCAGCGCAC ATTTATGAGC GACGCTACAA CGTCCCCGCT
CACGTGCTGA CTATGCCCAT CGGCTTGAGG AACACCGATG CCTTTGTCAA CACACTGGTT
GAGATCACCG ACCACACGAT CCCCGAATCG CTAGGGGTCG AACGGGCGCG CCTGCTCGAT
GCACTGGTTG ATACGCATAT GTACACGACA GGATTGCGTG TTGCGCTCTA CGGCGATCCC
GATATGCTTG AGGGGCTAGT CGGGCTGATC GCCGAAATGG GCATGATCCC GGCACACATC
CTGACCGCCG CCGACAACCG CTCCTGGGGA GAACGGATGG TCGAACTGAC AGAGGAACTG
GAGGTCGAGA GCGAGATCAT TCTCAAGGGT GATCTCCACG AACTGCACAA GCGGATCAAG
CAACGACCGG TCGATCTGCT GATGGGACAC TCGAAAGGCA AATTTATCGC CGAAGCGGAA
AACATCCCGC TGGTGCGAGT TGGTTTCCCG GTCGAAGATC GCTTTGGCTA CCATCGTCGA
TCTATCGTTG GCTACAACGG CGCGACTGCA CTGGTCGATG AGATCACAAA TATGATCTTC
GAGCGCCGTG CAACGGCGAT TGTGAGCAAC ACCCTGCTCG AAACCGGCCT CGAAAGACCA
ACAGACATTC CGATCACGCT ACGCAATGGC GCCGCACACC ATCCGTAG
 
Protein sequence
MTSCLTMQDR AVAINPTRSC APIGAMLANY GIHGAITINH GSQGCATYPR HQMARHFREP 
VEVATTSLTE KTTVYGGKQN LLAALKNIWE RFHPTMIMVC STCLSETIGD DIPGIIDEFL
DKRPEVTIPI LSVKTPSYIG NHTTGFDNFL KEIALNLPDR RKKKGETNGK INIIPGWVNP
GDIRELKHML REMGLHGLWI TDYSETLDGG YYHPRPHFPR GGTTVEELRN SSKSLATIAL
QRHIGGEAAH IYERRYNVPA HVLTMPIGLR NTDAFVNTLV EITDHTIPES LGVERARLLD
ALVDTHMYTT GLRVALYGDP DMLEGLVGLI AEMGMIPAHI LTAADNRSWG ERMVELTEEL
EVESEIILKG DLHELHKRIK QRPVDLLMGH SKGKFIAEAE NIPLVRVGFP VEDRFGYHRR
SIVGYNGATA LVDEITNMIF ERRATAIVSN TLLETGLERP TDIPITLRNG AAHHP