Gene Sare_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1795 
Symbol 
ID5708378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2067236 
End bp2068252 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID641271297 
Productinner-membrane translocator 
Protein accessionYP_001536672 
Protein GI159037419 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.121113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000132557 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAGCA TCGTGGACAG CAGGTCCGAG CCAGCCCGGC GGGGGACACC CAGCGGGCTG 
ACCACCGTCA CGCGGGCCCC CGGTTGGCTT CGGTACGCGC TGCTCGTCGT CGGCCTGGCG
GTGGCGCTCT GGCTACCGAA CGGGCTGTAT CCGGCGGTGG CGGTGGACAT CCTCTGCTGG
GCGCTGTTCG CCGTCGCGGT GGACCTGTTG CTCGGCTTCA CCGGGTTGCT GTCCTTCGGA
CACGCGGCCT TCTGGGGTGT GTCGGCGTAC GTGACCGGTC TGGTGGCGAT CCACCTTGGC
CTGCCGTTCC CGGCGGCGGT GCTGGCCGGG GCGTTCGCCG CGGCGGTGCT CGCAGTGCCG
ATCGGGTACC TGGCGGTGCG GCGAACCGGG ATCTACTTCG CCATGGTCAC CCTGGCATTC
GCGCAGCTGG TCTACTACGT CGCCAACGAG TGGCGGTCGG TGACCCAGGG CGAGAACGGC
CTCCAGGGCG TGCCGCGGGA GTTGTTCGGG CTCGACCTGA CCGACGACTA CTACTTCTAC
TACGCGATCC TGCCGATCGT GCTGCTCGGG CTGGCCGGGG CGTGGCGTAT CGTCCACTCG
CCGTTCGGTC GGGTGCTGGT GGGCATCCGG GACAACCCGG CGCGGGCCCG GGCGCTCGGC
TACCCGGTGC ACCGGTACAA GCTGACCATC TTCGTGCTCT CCGGCTTCGT CGCCGGCCTC
GGCGGCGGCC TGTTCGCCGT CAGTCACCGG TTCGTGTCGC TGGAGGTGCT GCACTGGACC
ACCTCCGGCA AGGCGGTGAT CGTGGTGGTG CTGGGCGGTA TCGGCACCCT CTGGGGCGGC
GTGCTCGGCG CCGGCATCGT GGTCCGCCTG GAGGACTGGC TGTCGTTCTC CGGCTTCGAG
GCGATCGGGC TGGTCACCGG CGGCCTCTTC GTGCTGGTGG TGCTGCTGTT CCGGCGGGGC
ATCTGGGGCA CCGCGTCCGC ACTGGCCCGG CGCCGGTGGG CGGCCCGCCG CGACTGA
 
Protein sequence
MTSIVDSRSE PARRGTPSGL TTVTRAPGWL RYALLVVGLA VALWLPNGLY PAVAVDILCW 
ALFAVAVDLL LGFTGLLSFG HAAFWGVSAY VTGLVAIHLG LPFPAAVLAG AFAAAVLAVP
IGYLAVRRTG IYFAMVTLAF AQLVYYVANE WRSVTQGENG LQGVPRELFG LDLTDDYYFY
YAILPIVLLG LAGAWRIVHS PFGRVLVGIR DNPARARALG YPVHRYKLTI FVLSGFVAGL
GGGLFAVSHR FVSLEVLHWT TSGKAVIVVV LGGIGTLWGG VLGAGIVVRL EDWLSFSGFE
AIGLVTGGLF VLVVLLFRRG IWGTASALAR RRWAARRD