Gene Sare_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4571 
Symbol 
ID5705354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5189408 
End bp5190496 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID641273982 
Productputative transposase 
Protein accessionYP_001539329 
Protein GI159040076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.225329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00149389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGGTC GTCCGCGTGG TGTGGTGGAA CTGACTGATG ACGAGCGTGC GTGTCTGAGC 
CGGTGGGCGC GGCGGGGTAA GTCGTCGCAG GCGTTGGCGT TGCGGTCGAA GATCGTGTTG
TTGTGCGCCG ATGGCCTGGT GAACACGCAT GTCGCGCTGC GGCTTGGGGT GTCGCGGGAC
ATGGTGGGTA AGTGGCGTAG CCGGTTCCTG GCGCGTCGGT TGGAGGGCCT TGTTGACGAG
CCTCGGCCGG GGGCGCCTCG TCGGATCAGC GACGACCGGG TCGAGGAGGT GATCGTGAAG
ACCCTCGAAC GGCAGCCGGC CAATCGGGAC AGTCACTGGT CGACCCGGTC GATGGCGCGC
GAGACCGGGT TGTCACAGAC GGCGGTGTCG CGGATCTGGC GGGCGTTCGG TCTCAAACCG
CATCTGGTGG ACACCTGGAA GTTGTCGGCT GACCCGATGT TCGTGGAGAA AGTCCGTGAC
GTGGTGGGTC TGTACCTGGA TCCGCCGGTC AAGGCGATGG TGCTGTGCGT TGATGAGAAG
TCGCAGATGC AGGCCTTGGA GCGGACCCGC CCGATGCTGC CGATGATGCC CACGGTCCCG
GCGAGGCAGA CCCATGACTA CGTCCGTCAC GGCGTGGCCA GCCTGTTCGC CGCGTTCGAC
CCGGCAACAG GCAAGGTCAT CGGCCAGGTG CACCGCCGGC ACCGCCATCA GGAGTTCCTA
AAGTTCCTGA AGGTCATCGA CGCCAACACC CCCGCCGAGG TGGACCTGCA CCTGGTCCTG
GACAACTACG CCACCCACAA GACCCCAGCC GTGCACCGCT GGCTGGCCGC GCACCCCCGC
TTCCACCTGC ACTTCACCCC GACATCAGCA TCCTGGCTCA ACCTCGTCGA GCGCTGGTTC
GCCGAACTGA CCAACCGCAA ACTCCGCCGG TCCAGCCACC GCAGCCTCAC CGACCTCGAA
ACCGACGTAC AGACCTGGAT CGAGGCATGG AACACCGAAC CGAAACCGTT CGTCTGGACC
AGAACCGCAG ACGAAATCAT GAGCAGCCTC GCCGCATACT GTGGTCGAAT TAACGACTCA
GGACACTAG
 
Protein sequence
MAGRPRGVVE LTDDERACLS RWARRGKSSQ ALALRSKIVL LCADGLVNTH VALRLGVSRD 
MVGKWRSRFL ARRLEGLVDE PRPGAPRRIS DDRVEEVIVK TLERQPANRD SHWSTRSMAR
ETGLSQTAVS RIWRAFGLKP HLVDTWKLSA DPMFVEKVRD VVGLYLDPPV KAMVLCVDEK
SQMQALERTR PMLPMMPTVP ARQTHDYVRH GVASLFAAFD PATGKVIGQV HRRHRHQEFL
KFLKVIDANT PAEVDLHLVL DNYATHKTPA VHRWLAAHPR FHLHFTPTSA SWLNLVERWF
AELTNRKLRR SSHRSLTDLE TDVQTWIEAW NTEPKPFVWT RTADEIMSSL AAYCGRINDS
GH