Gene Sare_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3224 
Symbol 
ID5705447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3714919 
End bp3715890 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content64% 
IMG OID641272655 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001538022 
Protein GI159038769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATAC CAGCCTGGAT CAGGGCAACG TCCCTCGTTG GCTCGGCCGT ACTGGCACTG 
ACGGCGTGCA CGGCCGACAT CCAGCCCCGG GTCGCGGCAC CCGGCCCGCC CGCGGCGATC
GAGTGCGGCA CCCTGACCCT GGCGGTCAAC CCGTGGGTCG GGTACGAGGC GAACGTCGCC
GTCATCAGCT ACCTGGCCAA GAACCAGCTC AACTGCACCG TGGTCGAGAA GGACCTCAGC
GAGGAGGAGT CCTGGAAGCT GCTCGCAGCC GGCGAGATCG ACGCGATCCT GGAGAACTGG
GGCCACGACG ACCTGAAAAA GCAGTACATC GACGACGAGC GGGTTGCCGT GGAACACGGT
CTCACCGGTA ACAAGGGCAT CATCGGCTGG TATGTCCCGC CATGGCTGGC CGAGAGATAC
CCGGGCATCA CCGACTGGCG GAAGCTGAAC GACTACACCT TTCTGTTCCG CACTCCCCGC
TCCGGTGGTA GGGGGGAACT GCTCGGCGGC GACCCCACCT ACGTCACCAA CGACAAGGCG
CTGATCCGCA ACCTGAAGCT GAACTACACG GTCACCTTCA CCGGAAGTGA GGACAAGCTG
ATCGAGGCGT TCCGCACGGC GGAGGAGGAG CGTCGGGCCG TCATCGGATA CTTCTACGCC
CCCCAGTGGT TTCTCTCCGA GGTCGATCTG GTGCACATCA GGCTCCCTGA GTACACACCC
GGCTGTGACG CGGATCCGGC GAAGGTGGCC TGTGACTACC AGCCGTATGA TCTCGACAAG
ATTGCCAACC GGGAGTTCGC CGAATCCGGT AGTCCGGCCG CGGATTTGAT CAAGAACTTC
CAGTGGACCA ACGCCGATCA GAACACGGTG GCCCGTTACA TCCGGCAGGA CAAGATGTCC
CGCGACGAGG CGGCCAAGAA GTGGCTGGAC GCGAACCCCG ACGTCTGGCG GTCCTGGCTG
CCTGCCACCT GA
 
Protein sequence
MRIPAWIRAT SLVGSAVLAL TACTADIQPR VAAPGPPAAI ECGTLTLAVN PWVGYEANVA 
VISYLAKNQL NCTVVEKDLS EEESWKLLAA GEIDAILENW GHDDLKKQYI DDERVAVEHG
LTGNKGIIGW YVPPWLAERY PGITDWRKLN DYTFLFRTPR SGGRGELLGG DPTYVTNDKA
LIRNLKLNYT VTFTGSEDKL IEAFRTAEEE RRAVIGYFYA PQWFLSEVDL VHIRLPEYTP
GCDADPAKVA CDYQPYDLDK IANREFAESG SPAADLIKNF QWTNADQNTV ARYIRQDKMS
RDEAAKKWLD ANPDVWRSWL PAT