Gene Sare_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3556 
Symbol 
ID5705049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4103687 
End bp4104808 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID641272983 
Producthypothetical protein 
Protein accessionYP_001538349 
Protein GI159039096 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0984898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0100122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAAA AAAATCGGGG GATCGTCGTC GCCGCCACCG GTGACCTGCT CATCGCGCGC 
GATCGTCCGC ACGACATGTT TCGGCACGTG CGGGACCTGT TGACCGAGGC CGACATCACC
GTCGGCCAGT TGGAGACGCC GTACTCCGAC CAGGGCTCCC GGAGTTCATG CGGGGCGCGC
GGCGCGGTCC CGAACGATGT GGCGAACTTC GCGGCGATCC CGCACGCGGG CTTCGACGTC
ATCTCCCTGG CGAGCAACCA CGCCGGCGAC TGGGGGGCCG ACGCGTTACT CGACTGCATC
GAGCGGTGCC GGCGCCACGG CATCACCGTG GTGGGTGCGG GGGCGGACAT CGCCGAGGCG
CGCCGGCCGG GGATCATCGA ACGCGATGGG ACCCGGGTCG GATTCCTGGC CTACTGCTCG
GTCGCGCCGG ATGGCTACTA CGCCGGGCCG GGTAATCATG GTGTGGCGCC GATGCGGGCG
AGAACACTCT ACGAACCGTT CCAGTTCGAC CAGCCCGGAG CTCCGCCCTC GGTCAGAACC
CTGCCGGACG AATACGATCT GGCGGCGCTC GTCGCGAACA TCGGCGAGTT GCGCGACCAG
GTGGATGTGC TGATCGTGTC GCTGCACTGG GGCCTGCTCT TTCAGCGCTC ACGGCTCGCG
GACTACCAGC CGGTGGTGGC GCACGCGGCG ATCGACGCCG GCGCCGACGT GGTGATCGGG
CACCACCCGC ACATCCTGAA GCCGGTGGAG GTCTACCGAG GCAAGGTCAT CTTCTACAGC
CTGGGCGACT TCGCCCTCGA GATCAACGAG CGCTGGTGGC GGTCGTTCAG CCGGGAGTGG
TTCGAGCGGG CGGTCCAGTT CTATCAGGCA CTCGCCCCCG GCCAGGATAT GCACGAGGAG
GGCCGGAACT CGATGATCGT CCAGCTGCAC ATCGTCGACG GCCGCATCGA CCGGGTTGGC
TTCGTACCCG TGACGATCAA CGATGCACGC GAGCCGGTGC CGTACCGGGC GGACACAGAG
GACGGGCGCG CGGTCCGCGC CTACCTGGCG CAGATCACGG CCGAGGCGGG GATCGACACC
ACCTTCGACG TGGTCGACGA CGAGGTCCTG GTCCGTATCT GA
 
Protein sequence
MGQKNRGIVV AATGDLLIAR DRPHDMFRHV RDLLTEADIT VGQLETPYSD QGSRSSCGAR 
GAVPNDVANF AAIPHAGFDV ISLASNHAGD WGADALLDCI ERCRRHGITV VGAGADIAEA
RRPGIIERDG TRVGFLAYCS VAPDGYYAGP GNHGVAPMRA RTLYEPFQFD QPGAPPSVRT
LPDEYDLAAL VANIGELRDQ VDVLIVSLHW GLLFQRSRLA DYQPVVAHAA IDAGADVVIG
HHPHILKPVE VYRGKVIFYS LGDFALEINE RWWRSFSREW FERAVQFYQA LAPGQDMHEE
GRNSMIVQLH IVDGRIDRVG FVPVTINDAR EPVPYRADTE DGRAVRAYLA QITAEAGIDT
TFDVVDDEVL VRI