Gene Sare_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3377 
Symbol 
ID5703411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3900968 
End bp3902110 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID641272803 
Productextracellular ligand-binding receptor 
Protein accessionYP_001538170 
Protein GI159038917 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0317113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTAC GGGCGCTCGG CGCCATCGGG TTGTCGGCAG CCCTGTTTGC CGCGGCGGGC 
TGCCAGGCGT CGGAGGACGA CAGTGCTGGC GGCAACGGCG ACTGCGGTGG CAAGATCGCC
ATTTTCGGTG CCTTCAGCGG CCCGAACTCG GGTCTGGTGA TCCCGTCGCT CAACGGCGCC
AAGCTCGCTG CGAAGCAGCA CAACGAGGCG AACCCGGACT GCGAGGTCAC CCTGCAGGAG
TTCGACACCC AGGGCGACCC GACCCAGGCC ACGCCAGTCG CGAACCAGGT CGTCAACGAC
GAGAGCTTCC TGGGCGTCAT CGGTGGTGCG TTCTCTGGTG AGAGCAAGGC GACGATGGAC
GTCTACGAGG CTGCCGGCAT GGTGATGGTG AGCCCGTCGG CGACCGCGAT CGAGCTGACC
GCCGGCGGCA ACAAGGTGTT CCACCGGGTG GTCGGTAACG ACGGCACCCA GGGTGCTGCC
GCCGCCGTCT ACTTCCGGGA TGTCGTGCAG GCCAAGAAGG TCTTCGTGAT CAATGACGGC
ACCACCTACG GTGCCGGTAT CGCCGAGGAG CTGACCAAGG CCCTCGGTGA CCTGGCCGCT
GGTACCGACC AGGTGCAGGA GAAGCAGGTC AACTTCGCGG CCACCATCTC GAAGATCAAG
GCCGCCGCGC CGGACGCGGT CGCCTACGGC GGCTACACGA ACGAGGCGGC TCCGCTGCTG
AAGCAGATGC GGGAGGCCGG TCTGACCACG ACCTTCCTCG GCTTCGACGG CCTGTACGAC
CCGGCCTTCC CGGAGGGCGC CGGCACCAGT GCCAATGGTG CGATCGTGAC CTGCCCGTGC
CTGCCTGCGG ACAAGGCCGG CGGCTCCTTC ACGGCCGACT TCGAGAAGGA GTACGGCGGA
CCGCCCGGAT CCTACGGTGC CGAGGGCTTC GACGCCGCGA ACGTGCTGAT CGAGGGCCTG
GCCGAGGGCA ACACCACGCG CGAGAAGCTG CTGGCGTGGG TCGACGCCTA CGACAAGGAG
GGGGTCTCGA AGTACCTCAA GTTCGCCGAC ACCGGTGACG TGGACGAGTC CCGGGTGGTG
ATTTGGGCCT ACGAGGTCAA GGACGGCGCG ATCACGGCCC AGCAGGAGAT CAAGCTCAGC
TGA
 
Protein sequence
MYVRALGAIG LSAALFAAAG CQASEDDSAG GNGDCGGKIA IFGAFSGPNS GLVIPSLNGA 
KLAAKQHNEA NPDCEVTLQE FDTQGDPTQA TPVANQVVND ESFLGVIGGA FSGESKATMD
VYEAAGMVMV SPSATAIELT AGGNKVFHRV VGNDGTQGAA AAVYFRDVVQ AKKVFVINDG
TTYGAGIAEE LTKALGDLAA GTDQVQEKQV NFAATISKIK AAAPDAVAYG GYTNEAAPLL
KQMREAGLTT TFLGFDGLYD PAFPEGAGTS ANGAIVTCPC LPADKAGGSF TADFEKEYGG
PPGSYGAEGF DAANVLIEGL AEGNTTREKL LAWVDAYDKE GVSKYLKFAD TGDVDESRVV
IWAYEVKDGA ITAQQEIKLS