Gene Sare_4774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4774 
Symbol 
ID5704441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5402649 
End bp5404070 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID641274172 
Producthypothetical protein 
Protein accessionYP_001539518 
Protein GI159040265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0272932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000356618 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCGGCC TGGCTCTCGC CACCATAGGT GTCGCCGCCC CGGCCGCGGA CGCGGTCGGA 
CGCCCGCTCA CCTCCGCCGA CGAGCAGCCC TCCGCCGACG AGCGCCGCAG CGGGGGTGAG
CCCGGTAAGG GCGAGTCCGG CAAAGGTGAG TCCGGTAAGG GCGAGCCCGG CAAGGCCGAG
TCCGGCAAGG GTGACCCTGG CAGAGGCGAG TCCGGCAGGG GCGAGTCCGA TCAGGGCAAG
GGCAAGAAGG AGCCGAAGCC GAAGGGCGTT CCGGTCCCCT GTGACGCGGA CAAGCTGATC
GCCGCGATCA CCCTGGCCAA CGCCCGCGGC GGCGCCGTGC TCGACCTCGC CAAGAAGTGC
ACCTACCTGC TCACCGCCAC CATCGACGAC GGCGCCGGCC TGCCGGTTGT CACCGCCCCC
ATCACCCTCA ACGGCGGCAA ACACACCACC ATCAAACGCG CCGCCGGGGT GGAGGAGTTC
CGCATCGTCA CCGTCGGCAC CGGCGGTGAC CTCACCCTCA ACCACCTGAC AATCACGGGT
GGACAGACTG ACGGCGATGG CGGAGGAATC CTGGTCAACG CCGGCGGAGC GTTGACCACC
AACCACAGCA CCGTCACCCG CAACATCGCT GGCAGCGACG GCGGCGGAAG CAGCGGCGGT
ATCGCCAACA ACGGCACCAC CACCATCAAA CACTCCGCCG TCAGCCGCAA CACTGCGGCA
ACCGCCGCTG GAGGCATCGG AAACACCGGT CAACTCGCCA TCAAGAAATC CTCCGTCACC
GCGAACATGG CCAACGCCGT CGTGGGCGGG TTCGGTGGAG GTGTCGGTAG CTTCCCCGGC
GGCACCACGG TCGTGACTGG CAGCACCATC AGCGGTAACC ACGCCGGCGA CGCTGGTGGG
GGTGCCGGCG GCTTCAACGC GAACGTCACC GTCACCGACA CCGCCATCAC CGGTAACAGA
GCCAGCAACG GCGGCGCGGT CTTCGCGGAG GGGGGCATGC TGGCCCTACG CCACGTCACA
GTCACCAACA ACACCGCCAC CCTTCAGGGC GGCGGCCTCA GCCTCCAAGC CCTCAACGCG
GCGACCGTGG CAACCGTCGC GGACAGCACA ATCGCGCACA ACGTCGGCAG TCTGAACGGT
GGAGGTATCG TCAACGCCGC GATCGCCTTC GCCTCCACAC TCGACGTGCG GAACACCCAC
ATCACGGCCA ATCAGGCAAC ATTTGGTGGC GGAATCTTCA ACATCGCCGT TGACGCCACG
GTCACGCTCA CCAACACGAA GGTCATCAAG AACATCGCCA TCAGTACTGG CGGGGGCATC
CTCAACTCGG GCGGAACGGT GAACCTGAAC ACGGCCACCG GTACCGTCGT GGTCAAGAAC
CGGCCAAACA ACTGCGTCAA CGTGCCCGGC TGCGTCGGAT AG
 
Protein sequence
MTGLALATIG VAAPAADAVG RPLTSADEQP SADERRSGGE PGKGESGKGE SGKGEPGKAE 
SGKGDPGRGE SGRGESDQGK GKKEPKPKGV PVPCDADKLI AAITLANARG GAVLDLAKKC
TYLLTATIDD GAGLPVVTAP ITLNGGKHTT IKRAAGVEEF RIVTVGTGGD LTLNHLTITG
GQTDGDGGGI LVNAGGALTT NHSTVTRNIA GSDGGGSSGG IANNGTTTIK HSAVSRNTAA
TAAGGIGNTG QLAIKKSSVT ANMANAVVGG FGGGVGSFPG GTTVVTGSTI SGNHAGDAGG
GAGGFNANVT VTDTAITGNR ASNGGAVFAE GGMLALRHVT VTNNTATLQG GGLSLQALNA
ATVATVADST IAHNVGSLNG GGIVNAAIAF ASTLDVRNTH ITANQATFGG GIFNIAVDAT
VTLTNTKVIK NIAISTGGGI LNSGGTVNLN TATGTVVVKN RPNNCVNVPG CVG