Gene Sare_4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4391 
Symbol 
ID5706099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4963214 
End bp4964608 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content64% 
IMG OID641273810 
Producthypothetical protein 
Protein accessionYP_001539160 
Protein GI159039907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036434 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGGGA AGGACACCAA CACCGCGATG AACGATCAGC GTTATGACCA CGAACCGGAC 
CGGCCAGGTG CGCGCCGGGC GAGGTCACGG TGGTGGGCGG TTGGACTGGC CGGCATGACC
GGCCTGGCCC TCACCACCAG CCTCGGCATC GGCGGCGCGC CGGCTATCGG CGCCGTCGAC
GGCATTCTCA CTGCCGCCGA TGACCGGCGG GGAAAACCTG ACCGCGCCTC CACCGACAAC
AAGGACCACA AGGGCAAGGG AACAAAGGAC AAACGCAAGG GCACACCGGT CCCGTGTGAC
GCCGATGCGC TGATCGCGGC GATCACCCTC GCCAACGCCC GCGGCGGCGC CGTCCTCGAC
CTCGCCAAGG ACTGCACCTA CCTACTCGCC GCCACCATCG ACGGCGCCGG CCTGCCCGCC
ATCACCACCC CGATCACCCT CAACGGCGGC AAACACACCA CCATCACACG CGCCGCCGCC
GCCGCTGAAC CGTTCAGAAT CCTCACCGTC GACATGGGCG GCGACCTCGC CCTCAACAAG
CTGAAAATCA CCGGCGGACA GACCGCTGCC GACGAGGACG GAGGAGGGAT CCTGGTCAAC
ACTGGCGGAA ACTTGATCAT CGACCATAGT GCCATCACCC GTAACATCGC CAGCGACAAC
GGCGGCGGGA TCGCCAACAA CGGCACTGCC ATTGTCAGAA ACTCCACCGT CAGCCATAAC
ACCGCTGGTC TAGCTGGCGG AGGTGTCGTC AGCGTCGGCG TACTCGACCT TAAGGCGTCC
CATGTGTCCG CCAATGCCGC CCTTGCCGGA GTGGCTGGGG TGTTCAGCGG GGGCACAGCC
CGGATCAAGC GCAGTACCAT TACCGCCAAC CATGCCCAGG TGGGAAATGT TGGCGGCCTT
GGTGTCTTTG GAACCGGTAA CGTTTCGGAT ACCCGGATCA CGGACAATAC CGCCCCGGAG
GGTGGCGGCG TCGTTGTGGG CGTCGGTGGA CAACTCACAC TCAGATCCGT AACCATCACC
GGAAACACGG CTAGGACGGG TCGTTCTGGT GGCCTAGGGA TTGACCCAAA CGCATCCGCT
GTTGTCGAGG ACAGCATCAT CAAAAACAAT GCCGCCATCG ACGGCGGCGG TATCTACAGC
TTCGCCGAGG CGGTGTTACG GCGCACGGTG CTTACCGGTA ACCAGGCAGG CAACGAGGGT
GGCGGCATCT ACAACCTTTC CAGCGGCGAG ATCAACCTCT TTTCAACAAA GATCATGAAA
AACGTTGCCG TCACGGACGG CGGCGGGATC TTCAACGAGG TGGGTGGCAC GGTCGAGTTG
AACACCGCCA CCGGCACCAC CGTGGTCAAG AACCGGCCGA ACAACTGCGT CAACGTCACC
GGCTGCCCGG ACTGA
 
Protein sequence
MTGKDTNTAM NDQRYDHEPD RPGARRARSR WWAVGLAGMT GLALTTSLGI GGAPAIGAVD 
GILTAADDRR GKPDRASTDN KDHKGKGTKD KRKGTPVPCD ADALIAAITL ANARGGAVLD
LAKDCTYLLA ATIDGAGLPA ITTPITLNGG KHTTITRAAA AAEPFRILTV DMGGDLALNK
LKITGGQTAA DEDGGGILVN TGGNLIIDHS AITRNIASDN GGGIANNGTA IVRNSTVSHN
TAGLAGGGVV SVGVLDLKAS HVSANAALAG VAGVFSGGTA RIKRSTITAN HAQVGNVGGL
GVFGTGNVSD TRITDNTAPE GGGVVVGVGG QLTLRSVTIT GNTARTGRSG GLGIDPNASA
VVEDSIIKNN AAIDGGGIYS FAEAVLRRTV LTGNQAGNEG GGIYNLSSGE INLFSTKIMK
NVAVTDGGGI FNEVGGTVEL NTATGTTVVK NRPNNCVNVT GCPD