Gene Sare_4443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4443 
Symbol 
ID5705921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5019478 
End bp5021364 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content76% 
IMG OID641273859 
Producthypothetical protein 
Protein accessionYP_001539208 
Protein GI159039955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGA CCGACCTGAC CGGCCGGGCA TCGGCCATCC CACCCGACTC GTCACCAGGG 
CCACGGCGCA CAGACGATCC CGGCGACGAT CGGTCCACCG GCAGCCCCAG CAGTGGGGCG
GGCATCGTCC TGCCACCCCA GTGGCGGGTG CGCCACCGGT TGCCGGACGG TCGGATTCGG
GCCGGCCAGG TGGTCGTCCT CCAACTGGCG GTCGTGATCG CGGTCGTGGC CCTGGGGCAG
GCAATGCCCG CGATGCTGAT CGGCCTGGCC GCGGCCGCGC TCCTGGCAAC CCTGGCCGGG
GCACGGGTGC GGGACCGCTG GCTGATCGAG TGGATCGGCA CCGCTGTCGC CTACGCGTTC
CGGCGGCGGA CTCTGCCCGC GGACGTCGGC TCAGCCGCCC TGCTGGACCG GCTCGACCCA
GGTGCCGTGC TCCGACCCGC CGAGCTGGCC GATGCGCCGG CGGCCGTGCT GGACGACGCC
ACCGGCCTGG TGGCCCTACT GGAGATCACC GACCCGTCCG AACTGATCGG CGACGAGGCA
CGCTCGCTGC CGCCGCCGGC CACCCTGCTC TCGGCCGGCA CGCCGCACGG ACCGCCGATC
CGGGTGCAGC TCCTGCTGAG CAGGACGACC GCCCCCGCGG TGGCCCTCGG CGGTGCGGTG
ATCGCCACGT CGTACCGGCA GCTCACCGAG GGGCGACTGG GTGGCCACGA ACGGGCGATA
CTCGCCGTCC GGGTGCTTCG CGTCGACGGC GCCTCCCCGG CCGAGCTGCG ACATGCCCTG
GCCGGCACGA TGCGCCGGAT CGTCCGCCGA CTCCGGCCAC TGTCAGGCCG TCCGCTGGGG
AAGCCCGCGG CGCTGGCGGC CCTGGCCGAG TTGGCACACC ACGAGGCCGG GCCGGTGCGG
GAGACCTGGT CGGCCCTGCA CGGGCGCCAC CTGCTCCAGG CCAGCTTCCA CCTCGACCGG
TGGCCCGACC CCCGCTCCGC GGGAGGGCGT CAGCTGGTGT CCGGGCTCCT CGCCGTGCCG
GCCACCGCCG TCACGGTCGC CCTCGCCGCC GGCCCGCGGC CCGGTACCGC CCGCTCCGAA
CTGGCCGTGC GATTGGCCGC CACCGCCCCG GCTGAACTCG CGGCAGCCAC TCGGACGGTG
CGGCGCACGG TCGACGAGGC GGGTGGCGAG GTGCGGCGGC TCGACGGGGA CCAGCTCGGT
GGACTCGCCG CCACCGTCCC GCTCGCCCTG CCCGGTCGGG GCCGGCCCGG CCCCGCGGCG
CCGGAGCTGA CCATCGGCGA CGCCGGCCTG TTGGTGGGCG TCAATCGGCA CGGGTCCGCC
GTGACGGTTC GGCTGTTCCG TCCGGAGGGC ACCCGAGTCG TGCTCGTCGG TGGACTCCGC
GCGGGCCAGA CACTTGTCCT GCGGGCGATG GCACTCGGCG CCCGGGTAGC GGTGCGGACC
ACCCGCCCCA CCTCCTGGGA ACCGTTCCTG CGTGCCGTCG GCGCTCCAGG TGGGGAGGTC
CCACGGCTAC TGCCCCCGGG CGGGCCGACC AACGACGCTT CGGGGTCACC GCTGCACCCC
CTGCTGCTGG TCGTCGACAC GGGTCCGGTC GGCGCTGCCC CTGAGCAACC AGGCCCGCCA
TGGCAGGCCA CGCTGCTCCT CCGCGACGAG CTGACCTCCG CCGACGTCGC GACACTCGGC
CACGCCGACC TGGCCGTGTT CCAGCCGCTC GACTCCACCG AGGCGGCGCT GGCCGGCAAC
GCGCTGGGGC TCGGCCCGTC CGCCGAATGG CTCACCCGCA TGCGACAGGA CATGGTTGCC
GTGGTCAACC GCCGGGCCCT GCGCTGGGCG CGCCTAACGC CGTCGCCGCT CGAGATGCGG
CTGGTAGGCC CACCCGATCG CCGCTGA
 
Protein sequence
MPATDLTGRA SAIPPDSSPG PRRTDDPGDD RSTGSPSSGA GIVLPPQWRV RHRLPDGRIR 
AGQVVVLQLA VVIAVVALGQ AMPAMLIGLA AAALLATLAG ARVRDRWLIE WIGTAVAYAF
RRRTLPADVG SAALLDRLDP GAVLRPAELA DAPAAVLDDA TGLVALLEIT DPSELIGDEA
RSLPPPATLL SAGTPHGPPI RVQLLLSRTT APAVALGGAV IATSYRQLTE GRLGGHERAI
LAVRVLRVDG ASPAELRHAL AGTMRRIVRR LRPLSGRPLG KPAALAALAE LAHHEAGPVR
ETWSALHGRH LLQASFHLDR WPDPRSAGGR QLVSGLLAVP ATAVTVALAA GPRPGTARSE
LAVRLAATAP AELAAATRTV RRTVDEAGGE VRRLDGDQLG GLAATVPLAL PGRGRPGPAA
PELTIGDAGL LVGVNRHGSA VTVRLFRPEG TRVVLVGGLR AGQTLVLRAM ALGARVAVRT
TRPTSWEPFL RAVGAPGGEV PRLLPPGGPT NDASGSPLHP LLLVVDTGPV GAAPEQPGPP
WQATLLLRDE LTSADVATLG HADLAVFQPL DSTEAALAGN ALGLGPSAEW LTRMRQDMVA
VVNRRALRWA RLTPSPLEMR LVGPPDRR