Gene Sare_1793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1793 
Symbol 
ID5708376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2065073 
End bp2066308 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID641271295 
Producthypothetical protein 
Protein accessionYP_001536670 
Protein GI159037417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000632283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGCCG CGTCGACGGC CACGGTGCTC GTGGTCTCCG GATGCGGTGG CGGAGGTCCC 
CAATCGGGCG GGGACGAGAA GTTTACCGAC GGCGCGATCG TGCTGGGCGT GCTCAACGAC
CAGTCCGGCG TGTACTCCGA GCTGTCCGGC CGGAACTCGG TCACGGCCGT GGAACTGGCC
GTCGCCGATT TCACGGCGAA ATACGGCGAC CAGGCGGTCA CCACGGACAT CACCGTGCAA
ACCGCCGATC ACCAGAACAA GCCGGATGTG GCCAACAGCA AGGCCCAGGA GATGTACGAC
CGTCAGGGGG TCGACCTGAT CCTGGACGTG CCCACCTCGT CGGCGGCGCT GCGGGTGGCC
GACGTGGCGA AGGAGAAGCA GAAGCTCTAC TTCAACATCG GTGCGGCGAC CACCGACCTC
ACCGGCAAGA GCTGCAACAA ATACACCTTC CACTACGCGT ACGACACGTA CATGCTCGCC
AACGGCACCG GTCGGACCAC CACCGAGCAG ATCGGCCGGA ACTGGTACAT CCTCTATCCG
AACTACGCGT TCGGTCAGGA CATGGAGAAG AGCTTCTCCA CGGCCATCGC CGACGCCGGC
GGACGGGTCG TCGGCAAGGA CGGGGCACCG TTCCCGAACA CCAGCGGCGA CTTCTCCACC
TACCTGCTGA AGGCGCCGAC ACTGGACCCG AAGCCAGACG TGCTCGGCAC CATGCAGGCC
GGCGCGGAAC TGGTCAACGT GGTGAAGCAG TACAACGAGT TCAAGCTGCG CGACAAGGGT
GTCGGGCTGG CCGTCGGACT GATGTTCATC ACCGACATCC ACTCACTCAC CCCAGCCGCG
CTGGCCGGCA CCACCTACAC CGACGCCTGG TACTGGAACT TCGACGAACA GAACCGTGAG
TTCGCCGACC GGTTCCAGCA GGAGACGGGC ACCCGGCCGT CCTTCGCGCA CGCGGCGAAC
TACTCCGCCG CCACGCAGTA CCTGGAGGCG GTGCAGGCGG CCGGCACCGA CGATGCCGAC
ACCATCGTCG AGGAACTGGA GGGCAAGGAG ATCAACGACG TCTTCCTGCG CAACGGCAAG
ATCCGCGCGG AGGACCACCG GGTGGTCCAC GACGCCTACC TGGCCCAGGT GAAGCCGCAG
TCCGAGGTCA CCGAGCCGTG GGACTACGTG CGGATCCTCG AGACCATCCC GGCCGGGGAG
GCGTTCCGGG CCCCGTCCCC GGACTGCAGC CTGTGA
 
Protein sequence
MVAASTATVL VVSGCGGGGP QSGGDEKFTD GAIVLGVLND QSGVYSELSG RNSVTAVELA 
VADFTAKYGD QAVTTDITVQ TADHQNKPDV ANSKAQEMYD RQGVDLILDV PTSSAALRVA
DVAKEKQKLY FNIGAATTDL TGKSCNKYTF HYAYDTYMLA NGTGRTTTEQ IGRNWYILYP
NYAFGQDMEK SFSTAIADAG GRVVGKDGAP FPNTSGDFST YLLKAPTLDP KPDVLGTMQA
GAELVNVVKQ YNEFKLRDKG VGLAVGLMFI TDIHSLTPAA LAGTTYTDAW YWNFDEQNRE
FADRFQQETG TRPSFAHAAN YSAATQYLEA VQAAGTDDAD TIVEELEGKE INDVFLRNGK
IRAEDHRVVH DAYLAQVKPQ SEVTEPWDYV RILETIPAGE AFRAPSPDCS L