Gene Sare_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0724 
Symbol 
ID5704529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp806044 
End bp807708 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID641270242 
ProductNa+/solute symporter 
Protein accessionYP_001535634 
Protein GI159036381 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000999119 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGCGTG ACCACGTCAC CGAGATCAGC ATCTTCACCG CGCTCTTCCT GCTGGTGAGT 
GTCATGGGAT TCATCGCGTC CCGCTGGCGT GCGCCGAAGG ACATGGCCCA CCTCGACGAG
TGGGGGCTGG GCGGGCGCAA CTTCGGCGGC TGGATCACCT GGTTCCTGAT CGGCGGCGAC
CTGTACACCG CGTACACCTT CGTGGCGGTG CCGGCGCTGG TATTCGGAGC CGGTGCGATG
GGCTTCTTCG CGGTGCCGTA CACCGTCGTC ATCTACCCGA TGGTCTTCCT GGTGCTCTGT
CGACTCTGGT CGGTGTCACA CCGGCACGGC TTCGTCACCC CGGCGGACTT CGTCCGTAGC
CGGTTCGACT CGCCGGTCCT CGCGCTGCTC GTGGCGATTA CCGGCATCGT GGCCACGATG
CCGTACATCG CGCTGCAACT CGTCGGCATC GAGGCGGTAC TCAAGGCCAT GGGCATGACC
GGGGAGAGCA GCCTGGCCCG GCACCTGCCC ATCATCATCG CGTTCGCGAT CCTGGCGGCG
TACACGTACC AGTCCGGGCT GCGTGCGCCG GCGCTGATCG CCTTCGTCAA GGACACCCTG
ATCTACGTGG TGATCCTGGC GGCGGTGCTG TACCTGCCGT ACAAGCTGGG GGGCTGGGGG
GAGATCTTCG ACGCTGCCGA CGCGAAGTTC GAGGCATCAC CGAGCCCCAA CGACGGGATT
CTGCTCAACG CCAACAACCA GGTCCAGTAC GTCACCCTCG CCTTCGGTTC GGCGCTGGCG
CTCTTCCTGT ACCCGCACAG CATCACTGGT GTACTGGCCA GCCGGAACCG GGACGTCATC
AAGCGGAACA TGTCGGCGTT GCCCGCCTAC AGTGTGCTGC TCGGGCTCAT CGCGCTGCTC
GGCTTCATGG CGATCGCAGC CGACGTGCAG CCGCTACCCG GGGCGACCGA GGGCACGGTG
GACAACAACA CCATCGTTCC GGTGCTCTTC GACCAGCAGT TCCCGAGCTG GTTCACCGGA
GTCGCGTACG CGGCCATCGG CATCGGGGCG CTGGTGCCGG CGGCGATCAT GTCGATCGCG
GCAGCGAATC TGTTCACCCG CAACATCTAC AAGGAGTACC TGCGGCGGGA CGCCACCCCG
GCCCAGGAGG CGAATGTCTC CAAGCTCACC TCGTTGGTGG TGAAGATCGG CGCAGTAGCC
TGCATCGTCT TCCTCGACCC GCAGTTCTCC ATCGATCTCC AGCTGATCGG CGGCGTGATC
ATCCTGCAGA CGCTGCCGGC GGTGGCCCTC GGCCTCTACA CCCGCTGGTT CCACCGCGGC
GCGCTGATCG CCGGCTGGGC AGTTGGCATG TCCCTGGGGA TGTGGATGCT CTACCAGGTG
TCCAGCCCGA CCCGGAGTCA CTTCGGTGGC TCCGCGTTCC CGCTGGAGAG GTTTGGCCTC
GACACCACGA TGACCATCTA CGCCGGATTC GTGGCGGTGC TGGCCAACCT GGTGGTCGCG
ACGGTGCTGA CACTGGTGCT ACGGACCGCG AAGGTATCCG ACGACGGCGA CGGGACCAGC
CCGGACGACT ACTTCGCCGA CGAGGGGGAC CCGCGGGTGA CCCCAGGCGA CCGCCGTGGC
GCCGACCCGG CCGCGGAGCC GGTGGCTGGA ACGAACGTCG GGTAG
 
Protein sequence
MWRDHVTEIS IFTALFLLVS VMGFIASRWR APKDMAHLDE WGLGGRNFGG WITWFLIGGD 
LYTAYTFVAV PALVFGAGAM GFFAVPYTVV IYPMVFLVLC RLWSVSHRHG FVTPADFVRS
RFDSPVLALL VAITGIVATM PYIALQLVGI EAVLKAMGMT GESSLARHLP IIIAFAILAA
YTYQSGLRAP ALIAFVKDTL IYVVILAAVL YLPYKLGGWG EIFDAADAKF EASPSPNDGI
LLNANNQVQY VTLAFGSALA LFLYPHSITG VLASRNRDVI KRNMSALPAY SVLLGLIALL
GFMAIAADVQ PLPGATEGTV DNNTIVPVLF DQQFPSWFTG VAYAAIGIGA LVPAAIMSIA
AANLFTRNIY KEYLRRDATP AQEANVSKLT SLVVKIGAVA CIVFLDPQFS IDLQLIGGVI
ILQTLPAVAL GLYTRWFHRG ALIAGWAVGM SLGMWMLYQV SSPTRSHFGG SAFPLERFGL
DTTMTIYAGF VAVLANLVVA TVLTLVLRTA KVSDDGDGTS PDDYFADEGD PRVTPGDRRG
ADPAAEPVAG TNVG