Gene Sare_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3107 
Symbol 
ID5706546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3531373 
End bp3532461 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID641272540 
Productputative transposase 
Protein accessionYP_001537908 
Protein GI159038655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0260258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGTC GTCCGCGTGG TGTGGTGGAA CTGACTGATG ACGAGCGTGC GTGTCTGAGC 
CGGTGGGCGC GGCGGGGTAA GTCGTCGCAG GCGTTGGCGT TGCGGTCGAA GATCGTGTTG
TTGTGCGCCG ATGGCCTGGT GAACACGCAT GTCGCGCTGC GGCTTGGGGT GTCGCGGGAC
ATGGTGGGTA AGTGGCGTAG CCGGTTCCTG GCGCGTCGGT TGGAGGGCCT TGTTGACGAG
CCTCGGCCGG GGGCGCCTCG TCGGATCAGC GACGACCGGG TCGAGGAGGT GATCGTGAAG
ACCCTCGAAC GGCAGCCGGC CAATCGGGAC AGTCACTGGT CGACCCGGTC GATGGCGCGC
GAGACCGGGT TGTCACAGAC GGCGGTGTCG CGGATCTGGC GGGCGTTCGG TCTCAAACCG
CATCTGGTGG ACACCTGGAA GTTGTCGGCT GACCCGATGT TCGTGGAGAA AGTCCGTGAC
GTGGTGGGTC TGTACCTGGA TCCGCCGGTC AAGGCGATGG TGCTGTGCGT TGATGAGAAG
TCGCAGATGC AGGCCTTGGA GCGGACCCGC CCGATGCTGC CGATGATGCC CACGGTCCCG
GCGAGGCAGA CCCATGACTA CGTCCGTCAC GGCGTGGCCA GCCTGTTCGC CGCGTTCGAC
CCGGCAACAG GCAAGGTCAT CGGCCAGGTG CACCGCCGGC ACCGCCATCA GGAGTTCCTA
AAGTTCCTGA AGGTCATCGA CGCCAACACC CCCGCCGAGG TGGACCTGCA CCTGGTCCTG
GACAACTACG CCACCCACAA GACCCCAGCC GTGCACCGCT GGCTGGCCGC GCACCCCCGC
TTCCACCTGC ACTTCACCCC GACATCAGCA TCCTGGCTCA ACCTCGTCGA GCGCTGGTTC
GCCGAACTGA CCAACCGCAA ACTCCGCCGG TCCAGCCACC GCAGCCTCAC CGACCTCGAA
ACCGACGTAC AGACCTGGAT CGAGGCATGG AACACCGAAC CGAAACCGTT CGTCTGGACC
AGAACCGCAG ACGAAATCAT GAGCAGCCTC GCCGCATACT GTGGTCGAAT TAACGACTCA
GGACACTAG
 
Protein sequence
MAGRPRGVVE LTDDERACLS RWARRGKSSQ ALALRSKIVL LCADGLVNTH VALRLGVSRD 
MVGKWRSRFL ARRLEGLVDE PRPGAPRRIS DDRVEEVIVK TLERQPANRD SHWSTRSMAR
ETGLSQTAVS RIWRAFGLKP HLVDTWKLSA DPMFVEKVRD VVGLYLDPPV KAMVLCVDEK
SQMQALERTR PMLPMMPTVP ARQTHDYVRH GVASLFAAFD PATGKVIGQV HRRHRHQEFL
KFLKVIDANT PAEVDLHLVL DNYATHKTPA VHRWLAAHPR FHLHFTPTSA SWLNLVERWF
AELTNRKLRR SSHRSLTDLE TDVQTWIEAW NTEPKPFVWT RTADEIMSSL AAYCGRINDS
GH