Gene Sare_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4224 
Symbol 
ID5704395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4794564 
End bp4796258 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content72% 
IMG OID641273643 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001538996 
Protein GI159039743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.174352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTACG ACGTGGTCGT CATCGGGTCC GGATTCGGCG GTGGCGTCGC CGCGCTACGG 
CTCGCCGAGA AGGGCTACCG GGTCGGGGTG ATAGAGGCGG GCCGACGCTT CGCCGACGAC
GAGTTCCCAC AGACCTCATG GCGGCTGCGC CGCTTCGTCT GGGCACCGCG GCTGGGTTGT
TACGGCCTGC AGCGGATCAC GTTGCTGCGG GCGGGCAACC GGCGGGCCGG CGGCGGTGTG
TTGGTGCTCT CCGGCGCCGG GGTGGGCGGA GGCTCACTGG TCTACGCAAA CACCTTGTAC
GAGCCGTTGG ACGCCTTCTT CGGGGATCCG CAGTGGCGGG ACATCACCGA CTGGCGGGAC
GAGTTGACCC GCCATTTCGA TCAGGCGAAG CGGATGCTCG GCGTCACCAC GTACCCGGTC
ACGACCGGAG CGGATCGGGC GATGCGGGCG GTGGCGGACC GGATGGGGGT CGGGCACACG
TACCGGGCCA CCCCGGTCGG GGTGCACATC GGTCGGCCCG GGCAGCGGGT GCCCGACCCG
TACTTCGGTG GGGCGGGGCC GGAGCGCACC GGGTGTACGC ACTGCGGCGC CTGCATGACG
GGGTGTCGGC ACGGCGCGAA GAACACGTTG GTCAAGAACT ACCTGTGGCT GGCCGAGCGG
CTCGGGGCGC GGGTGCACCC GTTGACGACC GTGACCGCCG TCCGGCCGGT CGAGGGGGGC
GGGTACGCGG TGCACACCGC ACGTACCGGC GCCTGGCTGC GTGGGCGTAC CCAGGTGATT
CACGCTGACC AGGTGGTCTT CGCGGCCGGT GCGTTGGGTA CGCAGCGCCT GCTGCACGGG
ATGAAGGCCA TCGGGGCGCT GCCCCGGCTC TCGCCCCGCC TCGGTGAGTT GACCCGAACC
AACTCGGAGG CGATCGTCGG TGCGTCGGTG CCCCGGCGGC GGGCGCGAGC GGACGGGACC
GACTTCACCG AGGGGGTGGC GATCACGAGT TCGTTCCACC CTGACTCCCA GACGCACATC
GAGCCGGTCC GTTACGGCCG GGGCTCGAAT GCGATGGCGC TGCTCCAGTC CCTGCTGGTC
GACGGTGGTC CTCGGCGGGT ACGCCGCTGG CTGGGCACCC ACGTGCGGCG GCCCCGCGAC
GTGGTGCGGA TGCTGTCGGT CCGTAACTGG TCCGAGCGCA CCGTGATCGC CCTGGTCATG
CAGTCGGTGG ACAATTCGCT GACCACTCGC CTCCGGCGAG GGCTCCGCGG CCGTCGGCTT
GTCTCCGATG CCGGTCACGG AGCGCCGAAC CCGACCTGGA TCCCGGCCGG CAACCGGGCG
GCTCGGCTCC TCGCCGAGGA GATCAACGGG GTGGCGGGCG GTTCGCTCAC CGAACCGTTC
AACATCCCGG TGACCGCGCA CATCCTGGGT GGCGCGGTGA TCGGTGCCAC CCCGGACGAC
GGCGTGGTCG ACCCGTGGCA CCGGGTGTAC GGGCACCCCG GGCTGCACGT CGTGGACGGT
GCCGCGGTCT CGGCCAATCT CGGGGTGAAC CCGAGCCTGA CCATCACCGC CCAGGCCGAG
CGGGCCATGT CCTTCTGGCC GAACAAGGGT GACGAGGATC ACCGCCCTCC GCTCGGCTCG
TCGTACGTCC GGCTGGCCCC CGTACCTCCG GACAATCCGG CGGTGCCGGC CGACGCACCC
GGCGCGCTGC GGTAG
 
Protein sequence
MRYDVVVIGS GFGGGVAALR LAEKGYRVGV IEAGRRFADD EFPQTSWRLR RFVWAPRLGC 
YGLQRITLLR AGNRRAGGGV LVLSGAGVGG GSLVYANTLY EPLDAFFGDP QWRDITDWRD
ELTRHFDQAK RMLGVTTYPV TTGADRAMRA VADRMGVGHT YRATPVGVHI GRPGQRVPDP
YFGGAGPERT GCTHCGACMT GCRHGAKNTL VKNYLWLAER LGARVHPLTT VTAVRPVEGG
GYAVHTARTG AWLRGRTQVI HADQVVFAAG ALGTQRLLHG MKAIGALPRL SPRLGELTRT
NSEAIVGASV PRRRARADGT DFTEGVAITS SFHPDSQTHI EPVRYGRGSN AMALLQSLLV
DGGPRRVRRW LGTHVRRPRD VVRMLSVRNW SERTVIALVM QSVDNSLTTR LRRGLRGRRL
VSDAGHGAPN PTWIPAGNRA ARLLAEEING VAGGSLTEPF NIPVTAHILG GAVIGATPDD
GVVDPWHRVY GHPGLHVVDG AAVSANLGVN PSLTITAQAE RAMSFWPNKG DEDHRPPLGS
SYVRLAPVPP DNPAVPADAP GALR