Gene Sare_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3997 
Symbol 
ID5704883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4547872 
End bp4549113 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID641273422 
Producthypothetical protein 
Protein accessionYP_001538778 
Protein GI159039525 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00194517 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACACAGG GCGAGCCGGA GCTGCCGGAG CCGACCGGCC GCATGAACCA GCCGCCGGCC 
GGTCAGCCCA CCGGACCAGG CCCGACCAAC GCACCGCGCC CCTCCTTCCC CGGCGACACC
GCGGGGGAAG GGGCCGAATC CGGTGCGCCC CCGGTTCCGC CCGAAGCGCC GCAGCGGCAG
CCCGACAACG GGTCCGGGCG GTTCGGCATG CCGGGACGGC CACTGCGGCG CAACAGCTTC
CTCGTCGGCT TCACCGGTGG CCTCGGCGCA CTGCTGGCGT ACGCACTCTT CCTCGGCCTG
CGCAACGCCG CCGGCCTGCT CGTCCTCGTG GTGATCGCGC TCTTTCTCGC CGTCGGGCTC
TACCCGGCGG TGGCGCGGCT ACGCCGGCTC GGGCTGCCCC ACGGACTGGC GGTCGCGGTC
GTCACGCTGA CACTTCTCCT GCTGTTCTGC AGCGGCGTGG TCGCGCTGGT ACCTCCGGTC
GTCACTCAGT CCAACCAGTT CATCGAGCAG TTTCCCAACT ACGTTGAGTC ACTGCGGCGC
AACGAGACGA TCAACGAGTT GGTCGAACGG TACGACCTGA TGGAACGGAT AGAGCGGGCC
GCCGACACCG ACACGCTCGG CCACGCGCTC GGCGGGGTAC TCGGCGGCGC TCAGCTCATC
TTCGGCACCG CATTCCGGAC CCTGACCGTG CTTGTGCTCA CCGTCTATTT CCTGGCGTAC
TTCAACCGGT TGCGGTCGCT CGGGTACGCG CTCGTTCCCC GGTCCCGGCG GGACCGGGTA
CGGCTGATCG GCGATGAGAT CATCATGAAG GTCGGCGCGT ACATCGTCGG GGCGCTCATC
ATCGCCGTCC TCGCCGGCAC GACCACCTTC GTGTTCGCGG TGATCGCCGA GCTACCGTAC
CCGTTCGCCC TGGCCGTCGT GGTGGCGGTG GCCGACCTGA TCCCGCAGAT CGGCGCGACG
CTCGGAGCGG TGATCGTGAG CCTGGTCGGC TTCGCCACCG ACCTGCCGGT GGGGATCGCC
TGCGTGGTGT TCTTCCTCAT CTACCAGCAG TTGGAGAACT ACCTGATCTA CCCCAAGGTG
ATGCGTCGAT CGGTGCAGGT CAACGAGGTG GCTGCGCTGG TCGCGGCGCT GCTCGGCGTC
GCCCTGATCG GCGTGGTGGG CGCCTTCATC GCGATCCCCA CGGTCGCGGC GTTCCAACTG
ATCCTGCGCG AGGTGATCGT CCCGCGTCAG GATTCCCGCT GA
 
Protein sequence
MTQGEPELPE PTGRMNQPPA GQPTGPGPTN APRPSFPGDT AGEGAESGAP PVPPEAPQRQ 
PDNGSGRFGM PGRPLRRNSF LVGFTGGLGA LLAYALFLGL RNAAGLLVLV VIALFLAVGL
YPAVARLRRL GLPHGLAVAV VTLTLLLLFC SGVVALVPPV VTQSNQFIEQ FPNYVESLRR
NETINELVER YDLMERIERA ADTDTLGHAL GGVLGGAQLI FGTAFRTLTV LVLTVYFLAY
FNRLRSLGYA LVPRSRRDRV RLIGDEIIMK VGAYIVGALI IAVLAGTTTF VFAVIAELPY
PFALAVVVAV ADLIPQIGAT LGAVIVSLVG FATDLPVGIA CVVFFLIYQQ LENYLIYPKV
MRRSVQVNEV AALVAALLGV ALIGVVGAFI AIPTVAAFQL ILREVIVPRQ DSR