Gene Sare_1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1452 
Symbol 
ID5704163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1676051 
End bp1678126 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content70% 
IMG OID641270961 
Producthypothetical protein 
Protein accessionYP_001536342 
Protein GI159037089 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000394394 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0118097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCGAC CCACCTCGCT GCTGAAGCGA CTGCTCGTCG GTCGACCGTT CCGGTCCGAC 
CGGCTTGCGC ACACCCTGTT GCCCAAGCGC ATCGCCCTGC CCGTCTTCGC CTCCGACGCG
CTTTCCAGCG TGGCGTACGC GCCCGACGAG ATCCTGCTGA TGCTCTCGAT CGCCGGGGCC
TCGGCGTTCG TGTTCTCGCC GTGGATCGCC CTGGCGGTGG TCGTGGTGAT GCTCGCCGTC
GTGGCGAGCT ACCGGCAGAA CGTGTACGCC TACCCGTCCG GCGGCGGCGA CTACGAGGTG
GCCACGGTCA ACCTGGGGCG GCGCGCCGGC GTCGGCGTGG CCAGCGCCCT CCTGGTCGAC
TACGTGCTGA CCGTCGCCGT GTCGGTCTCG TCCGGGGTGG CGAACCTGGG CGCGGTGATC
CCCTTCGTGG CCACCCACAA GGTGGCGGTG GCCGTCGGCG CGGTGGTGCT GCTCGCCGCC
GTCAACCTGC GTGGGATCAA GGAGTCCGGC ACCACCTTCG CCATCCCCAC CTACGGCTTC
ATGATCGTTA TCATCGGCAT GATCCTCACC GGGCTGGTCC GGGTGGTGCT GCTCGGCCAG
GACCTGCGCG CGCCCAGCGC GGATCTGGTC ATCGCCGCCG AGTGGAGCGA CACGACCGGG
TGGGCGATGG CGTTCCTGCT GCTTCGCAGC TTCTCCTCCG GCTGTGCCGC GCTCACCGGC
GTCGAGGCGA TCTCCAACGG GGTGCCCGCC TTCCGGGCGC CGAAGAGCCG AAACGCGGCG
ACCACCCTGT TCATGCTCGG CGTGGTCGCG GTGACCATGC TGGTCGGCAT CGTCTGGCTG
GCCCGGCTGA CCGGCCTGCA GTTCGTCGAG GACCCGGCCC GGCAGATCGT CGACGGCCCC
GACGGGTACG TGCAGAAGAC GGTCACCGCG CAGCTCGGCG AGACCATCTT CGGCTCCGGG
TCGCTACTGC TGTTCGTCGT CGTCGGCGTC ACCGCCCTGA TCCTCTTCCT GGCCGCGAAC
ACCGCGTTCA CCGGCTTCCC GGTGCTCGGG TCGATCCTCG CCCAGGACCG CTACCTGCCC
CGACAGCTGC ACACGCGGGG AGACCGGCTC GCGTTCTCCA ACGGCATCCT CTTCCTGGCC
GGCTTCGCGA TTCTGCTGAT CGTCGGTTTC CAGGCCGAGG TGACCAGGCT GATCCAGCTC
TACATCGTCG GCGTCTTCGT CTCGTTCACC CTGTCCCAGG CCGGCATGAT CCGGCACTGG
AACCGGTACC TCCAGACCGA GCGGGATCCG CAGGTACGCC GGCGGATGAT CCGCTCCCGG
GCGATCAACT CCTTCGGCAT GGCGATGACC GCCACCGTGC TGGTGATCGT GGTGGTCACC
AAGTTCCTGC TCGGCGCGTG GATCGCCATC GCCGCGATGG CCATGATCTA CCTGGTGATG
CTGGGAATCC GCCGGCACTA CGACCGGGTC GCCGTCGAAC TCACCCCGGA CGAGGGGCGG
CCGGTGAAAC CGGCCCGCAA CCACGCGGTC GTGCTGGTCA GCAAGGTGCA CCAGCCGACC
CTGCGGGCGC TCGCCTACGC GCAGGCCACC CGGCCGGACA GCCTGACCGC GGTGACGGTG
AACGTGGACG ACAAGGACAC CCGCCGGTTG CAGGCCGAGT GGGAGCGGCG GGACGTACCG
GTCGCCCTGA CCGTGATCGA CTCGCCGTAC CGGGAGATCA CCCGCCCGAT CCTGAACTAC
GTGGCCGGCG TCCGTCGGTC CTCACCCCGC GACGTGGTCA CCGTCTTCAT CCCCGAGTAC
GTGGTCGGCC ACTGGTGGGA GAACCTGCTG CACAACCAGA GCGCGCTGCG GCTTAAAGGG
CGGCTGTTGT TCGAGCCGGG GGTGATGGTC ACGAGCGTGC CCTGGCAGCT TGCCTCGACC
GCCGGCAAGG ACCTGGACCG GCTGGACGAG AACCTCAGCC GGGGACCGGC CCGCGGCCCC
CGGGTGACGC CCGGTGGGCC GCCGAGCACC CCGCCAGCGG TCTCCGCCCC GCCGACCAAG
ACTGGCCCGG ACCCGTCCGA CGGGAGTGAA CGGTGA
 
Protein sequence
MARPTSLLKR LLVGRPFRSD RLAHTLLPKR IALPVFASDA LSSVAYAPDE ILLMLSIAGA 
SAFVFSPWIA LAVVVVMLAV VASYRQNVYA YPSGGGDYEV ATVNLGRRAG VGVASALLVD
YVLTVAVSVS SGVANLGAVI PFVATHKVAV AVGAVVLLAA VNLRGIKESG TTFAIPTYGF
MIVIIGMILT GLVRVVLLGQ DLRAPSADLV IAAEWSDTTG WAMAFLLLRS FSSGCAALTG
VEAISNGVPA FRAPKSRNAA TTLFMLGVVA VTMLVGIVWL ARLTGLQFVE DPARQIVDGP
DGYVQKTVTA QLGETIFGSG SLLLFVVVGV TALILFLAAN TAFTGFPVLG SILAQDRYLP
RQLHTRGDRL AFSNGILFLA GFAILLIVGF QAEVTRLIQL YIVGVFVSFT LSQAGMIRHW
NRYLQTERDP QVRRRMIRSR AINSFGMAMT ATVLVIVVVT KFLLGAWIAI AAMAMIYLVM
LGIRRHYDRV AVELTPDEGR PVKPARNHAV VLVSKVHQPT LRALAYAQAT RPDSLTAVTV
NVDDKDTRRL QAEWERRDVP VALTVIDSPY REITRPILNY VAGVRRSSPR DVVTVFIPEY
VVGHWWENLL HNQSALRLKG RLLFEPGVMV TSVPWQLAST AGKDLDRLDE NLSRGPARGP
RVTPGGPPST PPAVSAPPTK TGPDPSDGSE R