Gene Sare_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1069 
Symbol 
ID5704337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1197133 
End bp1198623 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID641270585 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001535969 
Protein GI159036716 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.302171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00376489 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACG ACTACGACGC CATCGTCATC GGTGCGGGCA ACGCCGGGCT CAGCAGCGCG 
GCCACGCTCC AACGCGGTGG CAGGCGCACG CTCCTGGTTG AGCGCCACAA CGTTCCCGGT
GGTGCGGCGA CCTCGTTCGT GCGGGGGCGG TTCGAGTTCG AGGTGTCGCT GCACCAGTTG
GGTGGCATGG GGAACGGCCC ACTCCGGCAG GTTCTCGACC AGCTGGACGT CACGCGCAGG
TTGACGTTCG TGGAGGAGCG TGACCTCTAC CGCACCGTGG TCCCCGGGGT CCTCGACATC
ACCCTGCCCG CCGAGTGGAA GGGTGTCGCC GACGCGATCG AGGACAGCTT TCCGGGCAAC
CGCGCGCGGG TCGAGTCCTT CCTCGAGTTG TGCCACGAGG TCGGAAACTG GCAGCTGGTG
GCTCGGGCGA GCCTGCATCA GCCCCAGGAA CAGGCCGCGT GGCTGCGCGG GCTGTCCCGG
GTGCGCCGCA ACGGTCTGCG CCCGGCGAAG GAGGTGCTCG ACGAATACTT CGACGACGAC
CGGATCAAGC ATGTTCTGGC GTCCTACTGG AGCTACAACG GGCAGCCTCC CTCCACACTG
CCGTTCATGG ACCTGGCGAG AATTCTGACG CTCTACCTCG AATACAAGCC CTACCACCTG
CGCGGTGGCA GCCAGGCCAT GTCCTCGGCG ATGCTGGACT CCTTCCTGGA AGCGGGCGGC
GATGTCCGCT TCAACTCGGA CGTCGCGGAA ATCCGCACGC GCCAGGGCAC CGTGGTCGGG
ATCCGCCTGG CCAACGGCGA CGAGTACGAC GCTCGGATGG TGGTATCGAA CGCCTCCGCC
ATCACCACCT ACACCCGCAT GCTCGACCCC ACCGTCGTCC CCGATTCCGT CCTGCGCGAC
CTCCGCGGCA GAAAGCTGGG AATCTCGGGC ACGATCATCT ACCTCGGCCT GGACGCCACG
GCACACGAAC TGGGCTTCAC CGCGGGCACC AACATGATCA CCAGCGAGCT CGCGGAGAAG
ACCGTCCGGG ACAGCATGTT CTCCCTCGGT CCGACGCCTT ACGTGGTCGC GAGCTGTTAC
GACGTCGACC CGATCGGGTT CGCGCCGCCC CGGGCATCCC ACGTGGCGAT CTTCTCCGTC
CAGTACGGCA GGGTGTGGGA CACGCTCGCA CCGGAGGACT ACGCCCGAGC CAAGTACGCG
TACGCCCGAT CCCAGCTCGA CTTGGTCGAG GTGATAAGCC CTGGGCTGCG GGACGTCATC
GAGGAGGCCG AGGTGGCCAC CCCGCACACC CTCCGGCGCT ACCTCGGCCA CCCAGGCGGC
GCCATCTACG GCTTCGACCA GGACATCACC GACAGCTGGC TCTTCCGCGA CACCGATCTC
AAACCCAATG TGCCCGGCCT GTTCCTGCTC AGCGCGTGGA CCACCGCGGG CGGGTACAAC
CCCACGATCG TGACCGCGGC ACGCTTCTCC CAGCGGCTCC TGCAGCTCTA G
 
Protein sequence
MNDDYDAIVI GAGNAGLSSA ATLQRGGRRT LLVERHNVPG GAATSFVRGR FEFEVSLHQL 
GGMGNGPLRQ VLDQLDVTRR LTFVEERDLY RTVVPGVLDI TLPAEWKGVA DAIEDSFPGN
RARVESFLEL CHEVGNWQLV ARASLHQPQE QAAWLRGLSR VRRNGLRPAK EVLDEYFDDD
RIKHVLASYW SYNGQPPSTL PFMDLARILT LYLEYKPYHL RGGSQAMSSA MLDSFLEAGG
DVRFNSDVAE IRTRQGTVVG IRLANGDEYD ARMVVSNASA ITTYTRMLDP TVVPDSVLRD
LRGRKLGISG TIIYLGLDAT AHELGFTAGT NMITSELAEK TVRDSMFSLG PTPYVVASCY
DVDPIGFAPP RASHVAIFSV QYGRVWDTLA PEDYARAKYA YARSQLDLVE VISPGLRDVI
EEAEVATPHT LRRYLGHPGG AIYGFDQDIT DSWLFRDTDL KPNVPGLFLL SAWTTAGGYN
PTIVTAARFS QRLLQL