Gene Sare_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3559 
Symbol 
ID5705052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4107123 
End bp4108946 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content71% 
IMG OID641272986 
Producthypothetical protein 
Protein accessionYP_001538352 
Protein GI159039099 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0477684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACCT CGCCGTCCGT GGTGTCCACG CCGACGCCAC CCTCCCGCGT TCACCGCAAC 
CGCCGCGCGG ATCTCATCGT GACGCTCGTC GCGCTCGCCC TCGCGGTGTG GGTGACCAGC
GGGCTGTGGC GGGACCCGAA CGGTCGCACG ATCACCGTCA ACTCCAGCGA CCAGGCACTG
TTCGAGTGGC TGCTGGCGTT CGGCGGGCAT GCCCTCACCC ACGGGCAGAA CCCGCTGTTC
ACACACATGA TCAACTTCCC AGACGGGGTG AACCTGGCAG TCAACACGTC GATCACTGTG
TACGCGGCGG TGTTCGCGCC GCTGACATAC CTGATCGGGC CACCGGCCAC CTTCCTGGTG
ATCCTCACGC TCAACCTCAC CGCCACCGCG GTCGCCTGGT ACTGGCTACT CAACCGCCAG
TTCACCCGCA GCCCGCTCGC CGCCGGGCTG GGTGGGCTGT TCATCGGCTT CTCCCCCGGC
ATGATCTCGC ACGCCAACGC CCACCTGAAC TGGACCGCCG GCTGGTTGGT ACCGCTGCTG
ATCTGGCGCG TCCTCGCGCT GCGCCGCGCC GGACACCGGG TCCGCGACGG TGTCATCCTC
GGCGTGCTGG TCGCGGTGGC GTTCTCCATC GCTGCCGAAG GGCTCTTCTT CACCGCGCTC
GCCCTCGGCC TCTTCGTCGT CGTCTGGGCA CTGCACCCCG CCAGCCGCGC CGAAGCGCGT
GCGGCGCTGC CCGGCTTCCT CGGCGGACTC GCGGTGACCG CGCTGGTGGC CGGGGTGCTC
CTGGCGTACC CGCTGTGGCT GCACTTCGCC GGCCCGCAAC GCTTCGACGG CACCGGCTTC
GACGCGGTGA CCCACTCCGA GGACATCGCC TCGTACGTCG CCTACCCCCG ACGCTCGCTC
GCCGGCGAGG CCGGGCTACG AACCAACCTC GCCCCGAACC CGACCGAGGA GAACTCGTTC
TTCGGGCTGC CGCTGCTGCT CCTCGCCGTC GCCGCCTTCG TCCTGCTGTG GCGGCGCGTC
CAACAGGCGC CCCAACGGGC CACGCTCTGG GCGCTCGGTG CGGTGGCGGC GGCCTTCACC
GTGTTGTCGT TCGGCCCCCA GGCCAAGGTC GACGGCCGCC GCTTCGATCT GCCGATGCCC
TTCGACGTGC TGGCTCACCT GCCGGTGGTG AACGCCGCGC TGCCCGCCCG GTTCGCGCTG
GTCGTGGCGC CGGTCATCGG CGTGTTGCTC GCGTACACGG TGGATGCGCT GCGGGCCGAA
CCGCCTCGGT CCCGGCCGGC CCGGATCGCC TGGCTGGTGG CGTTCGCCGT GGCGCTGGCG
CCGCTGATAC CTACCCCACT GCTCACCATC GAGCGGGAGC CGATCCCACG CTTCTTCACC
TCTGGGGCCT GGCAGGAGTA CGTCTCACCG GGCGGGATCC TCACACCGGT GCCGCTGGCC
GTGGACGTGT ACCCGGACGG ACAGCGGTGG CAGGCGTACG TCCTCGCGAA CCGGCAGGGC
GAGTTCCGGA TCCCGTCCGG GTTCTTCCTC GGTCCCGGTG GTCCGGACGG GCGCGGCCGC
ATCGGCCCGG TTCCTCGACC GATGAGCGCC ATCTTCGATC AGGCCGCCCG CCAGAGTGTG
GTGCCGATCG TCACCGAGGG GACTCGCCAG GACGTGCAGG CCGACCTGCG GCACTGGCAG
ATCGAGACGG TGGTGCTCCC GGACCAGGTG CACGGGGCGA AGTGGGACGT GGACGAGGAG
GCGGTCCGTC GAACCGCCAC CGAGCTGTTC GGCGAACCGG AACGCGTCGA GGATGTCTGG
GTCTGGCGGA TCCCACCGGG CTGA
 
Protein sequence
MPTSPSVVST PTPPSRVHRN RRADLIVTLV ALALAVWVTS GLWRDPNGRT ITVNSSDQAL 
FEWLLAFGGH ALTHGQNPLF THMINFPDGV NLAVNTSITV YAAVFAPLTY LIGPPATFLV
ILTLNLTATA VAWYWLLNRQ FTRSPLAAGL GGLFIGFSPG MISHANAHLN WTAGWLVPLL
IWRVLALRRA GHRVRDGVIL GVLVAVAFSI AAEGLFFTAL ALGLFVVVWA LHPASRAEAR
AALPGFLGGL AVTALVAGVL LAYPLWLHFA GPQRFDGTGF DAVTHSEDIA SYVAYPRRSL
AGEAGLRTNL APNPTEENSF FGLPLLLLAV AAFVLLWRRV QQAPQRATLW ALGAVAAAFT
VLSFGPQAKV DGRRFDLPMP FDVLAHLPVV NAALPARFAL VVAPVIGVLL AYTVDALRAE
PPRSRPARIA WLVAFAVALA PLIPTPLLTI EREPIPRFFT SGAWQEYVSP GGILTPVPLA
VDVYPDGQRW QAYVLANRQG EFRIPSGFFL GPGGPDGRGR IGPVPRPMSA IFDQAARQSV
VPIVTEGTRQ DVQADLRHWQ IETVVLPDQV HGAKWDVDEE AVRRTATELF GEPERVEDVW
VWRIPPG