Gene Rcas_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3799 
Symbol 
ID5541301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4970658 
End bp4971533 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content64% 
IMG OID640895909 
ProductNifU domain-containing protein 
Protein accessionYP_001433856 
Protein GI156743727 
COG category[O] Posttranslational modification, protein turnover, chaperones
[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC CACTCAACGG CCACGATACT GCCATCGACG CACTGGAGCG GCTGGCGCTG 
CGTATCGAGC AGGCAATAGC GGCGGTGGGC AAACTGGATG ATCAGGCGCG TGAGTGTGCG
CTCGAACTGC AACGCGCCAT TGAAACCTTC CACAAAGAGG CGCTCACCCG CATCGTGCGC
CGCCTGAAGG ACGATCCGCG CGGCAAAGAA CTCCTCTTCG ATCTGGTCGA TGATCCGCTC
ATCTATACGC TCTTCGCGCG CCACGAGATC GTGCGCCCGA ACATCGTCGC GCGCGTCTCG
CGTGTGCTCG ACGCAGCGCG CCCCTACATC CGCTCGCACG GCGGCGATGT CGAACTTGTC
GAAGTGCGCG AGAATGTGGT CTACGTCCGG CTCCACGGCA GTTGCAACGG ATGTTCGCTC
TCGGCGGTGA CGCTGCGCAA CGAAATCGAA GCCGCGCTGC GCGCCAATGT GCCCGAAATT
GTCGGTGTGC AGGTGGTCGC CGCCGGCGCG GTCCCGGCGC TGATCATGCC TGAAAGCATC
GGCGTGCGTG ACCCGCTCGC CGGATGGGTC GCCGGTCCGC CGGCGAGCGA TGTGCCTCCC
GGCGCAATGC GCCGTCTGGA AACCGATCAT GCGGACATTC TGATTATCAA TCTCGACGGA
CGGTTGAGCG CCTTTCGCAA TGCCTGTGCG CATCAAGGGC TGCCGCTCAA CGGCGGGTTG
CTCGATCCCG AAAGTGGCAC ACTTACCTGC CCATGGCACG GTTTCTGCTA CGACGCCAGC
AGCGGCGAAT GCCTGACTGT GCCGCAAGCG CAATTGGAGC CGTTCCCGCT GCGCGTCGAA
CAGGGACGCA TCTGGGTGCG CCCGACGCAG GGATAG
 
Protein sequence
MTTPLNGHDT AIDALERLAL RIEQAIAAVG KLDDQARECA LELQRAIETF HKEALTRIVR 
RLKDDPRGKE LLFDLVDDPL IYTLFARHEI VRPNIVARVS RVLDAARPYI RSHGGDVELV
EVRENVVYVR LHGSCNGCSL SAVTLRNEIE AALRANVPEI VGVQVVAAGA VPALIMPESI
GVRDPLAGWV AGPPASDVPP GAMRRLETDH ADILIINLDG RLSAFRNACA HQGLPLNGGL
LDPESGTLTC PWHGFCYDAS SGECLTVPQA QLEPFPLRVE QGRIWVRPTQ G