Gene Sare_4599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4599 
Symbol 
ID5706620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5218905 
End bp5220341 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content65% 
IMG OID641274003 
Productparallel beta-helix repeat-containing protein 
Protein accessionYP_001539350 
Protein GI159040097 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.119679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGTC AGAATCACAC ACACAAGCTG GATCGCGCTG GTACGGGCCG GGCACGGTCA 
CGGCGACGGT GGTCGGCTGC CGGTCTGGCC GGTGCGATCG TTCTGGCTCT CACCACCGCT
GGGATGGCCG CCGCCTCGAC CGCCGACGCT GTCGGGCGTA CCGTCACCAC CACACAGCTC
GGCGTTGACC AGCCCGGGAA ACCCATCCGC GACGAGCCCC CCGCAGGAGA CCACCGCGAC
GATCGCGGCA ACCCCGACGA CAGCAAGCAT GCCGGGGCTA AGGGGCAGAA GGAGGGTCGA
AGGGGTGCGC CGGTCCCCTG CGACCCGGAC CGGTTGATCG CCGCGATCAA ACTGGCCAAC
GCCCGCGACG GCGCTGTTCT CGACCTCGCC GATGACTGCA CCTACCTGCT CACCGCCAAC
ATCGACGGCG CCGGCCTCCC CGCAATCACC AGCCCGGTCA CCCTCAACGG CGGTGATGGC
ACCACCATTG AACGCGCGGC CGCCGCCGAC CAGTTCAGGA TCCTCACCGT CAACGCCGGC
GGTGACCTCA CCCTCAACCA CCTCAAAGTC ACCGGCGGCC AAACCCCTGT AGGCGCCGAT
GGGGGCGGCA TCCTCGTCAA CGCCGGCGGT CGGTTGACCG CGAGGCACAG CAAGATTCTC
GGCAACATCG GTGGTGGGGG TGGTGGCGGC AGCGGTGGCG GGATCGCCAA CCTTGGCGTC
ACCGCGATCA AGAATTCCAC CGTCAACCAG AACACCGCCG GGTTTCTGGG CGGCGGCATC
TACAACACAG GTCTACTCAC CATCAGAAAG TCCCACGTCG AGGCGAACAA CGGTCCCTTC
GGAGGAGGTG GTGTCACGAA CAGCGGAGGA ACAATGCGGG TCACGCACAG CGCCATCTCC
GGCAACCGGT CCATCCAGGG TGCCGGCCTG TTCGTCGTCG ACGGCGGAAA CGGCGCCGTC
AGTGACACCC GCATCACTGA CAACACCGCG ACAATCATCG GCGGAGGTGT CTACCTGAGC
GGCCAACTGA CCATGCGGCG GGCCGTCCTT GCTGCCAACA CCGCGGTTGG GGGTGGTGGC
GGTGGCCTGT TCGTCAGTGC AGGATCCACA GCGACCATCG AGGGAAGTGT TGTCAAGGAC
AACACCGCCA CCGGTGCCAC CGGTTTCGGC GGGGGCATCT TCAACAACGC CGAGACAACG
GTTCGTGGCA CGAAAGTGAT CGGTAACGTC GCTGACCAGG GGGGTGGTCT CCACAACGAT
TCGAGCGGAG TACTCACTCT CGTCGCCGTC ACGGTCACCG ACAATATCGC AATAACCGAT
GGTGGGGGAG TCTTTAACGC AGCCGGCGGT GTGGTCGCGC TGAACACAGC CACCGGGACA
ACTGTTATTG GCAATCAGCC GAACAACTGC GTCAACGTTC CCGGCTGTGC CGGATAA
 
Protein sequence
MTCQNHTHKL DRAGTGRARS RRRWSAAGLA GAIVLALTTA GMAAASTADA VGRTVTTTQL 
GVDQPGKPIR DEPPAGDHRD DRGNPDDSKH AGAKGQKEGR RGAPVPCDPD RLIAAIKLAN
ARDGAVLDLA DDCTYLLTAN IDGAGLPAIT SPVTLNGGDG TTIERAAAAD QFRILTVNAG
GDLTLNHLKV TGGQTPVGAD GGGILVNAGG RLTARHSKIL GNIGGGGGGG SGGGIANLGV
TAIKNSTVNQ NTAGFLGGGI YNTGLLTIRK SHVEANNGPF GGGGVTNSGG TMRVTHSAIS
GNRSIQGAGL FVVDGGNGAV SDTRITDNTA TIIGGGVYLS GQLTMRRAVL AANTAVGGGG
GGLFVSAGST ATIEGSVVKD NTATGATGFG GGIFNNAETT VRGTKVIGNV ADQGGGLHND
SSGVLTLVAV TVTDNIAITD GGGVFNAAGG VVALNTATGT TVIGNQPNNC VNVPGCAG