Gene Sare_4760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4760 
Symbol 
ID5707477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5388673 
End bp5389821 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID641274158 
Producthypothetical protein 
Protein accessionYP_001539504 
Protein GI159040251 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.124071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00173955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGCCT ATCCATTCAC GTTTGTGACC CGTGTGCGTC GTCGGCTCCT GCCGGCGGTC 
GGCGCACTGC TCGCGGTTCT GGTCGTCGTC GGCTGCGGCG AGGCGGGGTC CGACGCGGTC
TGGCAGCCCG GCGTCGGCGG TGACCCAGCC TCCTCCGCCT CGCCCTCCCC CTCCACGGCT
GCGGCGGTAT CCCTCACCGC GACCGGCGAC ATCATCCTGG GTAACGCCCC GGACAGGTTG
CCACCCGACG ACGGCAAGGG CTTCTTCGAC GACGTGCGCC AGGCTCTCGC CGCCGACCTG
GCAATGGGCA ACTTGGAGGA GCCGCTCACC GTCGACACCG GTACCGGCAA GTGCGGGGCC
GGCTCGACCA ACTGCTTCCA GTTCCGGGCG CCACCGGAGT ACGCGGCACA CCTCCGCGAC
GGCGGCTTCG ACCTGCTCAA CCTGGCAAAC AATCATGGGA ACGACTTTGG GGCCAAAGGC
TTCCAGAACA CCCAGGCCGC GCTTGAGCAG CACGAACTGG CACACACCGG CGCGCCGGAC
CAGATCACCG TCGTGGAGGT GCAGGGCGTC CAAGTGGCGG TGGTGGGCTT CTCGTCCTAC
GCGTGGTCGA ACCCGCTGAC CGACATCCCA GCGGCGACGA AGGTCGTCAC CAAGGCGGCC
GAGACGGCGG ACCTGGTGGT CGTGCAGGTG CACATGGGCG CGGAGGGTGC CGACAAGACC
CGGGTCAAAC CCGGCACCGA GTTGTACCTG GGTGAGAACC GGGGTGATCC GATCCGGTTC
GCTAAGGCCA TGGTCGACGC CGGCGCGGAC CTGATCGTCG GGCACGGGCC ACACGTCCTC
CGCGGCATGG AGTTCTACCA GGGCCGGCTG ATCGCGTACA GCCTGGGCAA CTTCGCCGGT
GGCGGCAACA TGCTCAACCG CAGCGGCCGG CTCGGCTGGG GCGGCGTACT CAAGGTCTCG
CTGAAGCCGG ACGGCACCTG GGTCGACGGG TCGTTCGCCT CGACGTACAT GAACGAGTTG
GGTCTGCCGA CGATGGACCC GGACGACCGG GGCCTGGGGC TGGTGCGTGA GCTCAGCGGT
GCGGATTTCC CCAAGACCGG TGCGACCTTC GACGACTCCG GGACGATCAG CCCACCCCGC
GCGGGCTGA
 
Protein sequence
MRAYPFTFVT RVRRRLLPAV GALLAVLVVV GCGEAGSDAV WQPGVGGDPA SSASPSPSTA 
AAVSLTATGD IILGNAPDRL PPDDGKGFFD DVRQALAADL AMGNLEEPLT VDTGTGKCGA
GSTNCFQFRA PPEYAAHLRD GGFDLLNLAN NHGNDFGAKG FQNTQAALEQ HELAHTGAPD
QITVVEVQGV QVAVVGFSSY AWSNPLTDIP AATKVVTKAA ETADLVVVQV HMGAEGADKT
RVKPGTELYL GENRGDPIRF AKAMVDAGAD LIVGHGPHVL RGMEFYQGRL IAYSLGNFAG
GGNMLNRSGR LGWGGVLKVS LKPDGTWVDG SFASTYMNEL GLPTMDPDDR GLGLVRELSG
ADFPKTGATF DDSGTISPPR AG