Gene Sare_4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4349 
Symbol 
ID5708417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4915825 
End bp4916799 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content70% 
IMG OID641273771 
Producthypothetical protein 
Protein accessionYP_001539121 
Protein GI159039868 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03620] probable F420-dependent oxidoreductase, MSMEG_4141 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGAA CTTCTCCAGA TCGTACTGGT GACATCCACC ACCGGGAGAC CGACACGAAC 
GACAGTCACC GAGGGCTCGG TCGAGTCGGC ATCTGGACCA TGGCGTTCGA CTGGCAGCCA
GCCGGGCTCG TCCGCGACGC GACCGCCGAG TTGGAGGAAC TCGGCTACGG TGCGGTGTGG
TACGCCGAGG GCCTCGGCCG CGACGCGGTC AGCCAGGCAT GGCTCATCCT GGGCAACACC
CGGCGGCTGG TCGTCGGAGC GGGCGTCGCC AACATCGCCA CGCGGGAACC AATCGCGATG
GCCGCAGCCC ACCGTGCGCT GGACGACGCG TACGCGGGAC GGTTCGTGCT GGGACTCGGC
GGACATCGAA CCCACGACAC CCCGACCAAC GCTATCCCCG GGCGCTACGG ACGACCGGTA
CAGACGATGA CCGCCTACCT CGACGCCATG GACGCCGCCA CCACCGTGCT TCCCGAGCCA
ACACCTCCTC GCCGCCGGGT CCTCGCCGCA CTCGGCCCCA GAATGACCGA ACTCGCCGCA
CAACGCACCG AGGGCGCCCT GCCCTACTTC GCACCCGTCG AACACACCCG CCGCGCCCGG
GAGGCCATGG GACCTGGTCC ACTGCTCGCA GTGGAACTCG CGGTCGCCCT CGCCGACGAA
CCCGATCGCG GGCGCCAGCT GGCCCGCGAC CATGTCGCCT ACTACACCTC CACCGCCCCG
CACCAGGCCG CCAATCTGCG TCGCCTGGGC TTCACCGAAC AGGACATGCG GGGCCTGAGT
AGCACCCTGG TCGACGCCGT GGTCGCCCAC GGCGACCTCG ACACGGTACG CACCCGCGTG
CGGGAGCACC TGGACGCGGG CGCAAACCAC GTCTGCATCC AGGTGCTCAC CGCGGATCCG
GCCACGCTGC CCATGGACGA GTGGCGGGAG CTGGCGTTCC TCACCACCGA GGCGACGACA
TCGAGGGTCG GTTGA
 
Protein sequence
MRRTSPDRTG DIHHRETDTN DSHRGLGRVG IWTMAFDWQP AGLVRDATAE LEELGYGAVW 
YAEGLGRDAV SQAWLILGNT RRLVVGAGVA NIATREPIAM AAAHRALDDA YAGRFVLGLG
GHRTHDTPTN AIPGRYGRPV QTMTAYLDAM DAATTVLPEP TPPRRRVLAA LGPRMTELAA
QRTEGALPYF APVEHTRRAR EAMGPGPLLA VELAVALADE PDRGRQLARD HVAYYTSTAP
HQAANLRRLG FTEQDMRGLS STLVDAVVAH GDLDTVRTRV REHLDAGANH VCIQVLTADP
ATLPMDEWRE LAFLTTEATT SRVG