Gene Sare_4397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4397 
Symbol 
ID5703445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4968237 
End bp4969601 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content62% 
IMG OID641273815 
Producthypothetical protein 
Protein accessionYP_001539165 
Protein GI159039912 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.792915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0309659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC AGCGCCACGA CCACGAACCG GACCGACCAG GTGCGCGCCG GGCGAGGTCA 
CGATGGTGGA ACGTCGGACT GGCCGGCATG ACCGGCCTGG CCCTGACCAC CAGCCTCGGC
ATCGGCGGAG CGCCGGCTAC CGGCACCGTC GACCGCACTC TCACCGCCAC CGATGACCGG
CCGGGAAAAC CCGACCATGC CTCCACCGAC AACAAAGATC ACGAGGGCAA GGGAAAGAAG
GGCACACGCA AGGGCACACC GGTCCCGTGT AGCGCCGACG CGTTGATCGC CGCGATCACT
TTGGGCAACG CCCGCGGTGG CGCCGTGCTC GACCTTGCCA GTAAGTGCAC CTATCTGCTT
ACCGCCGACA TCGACGACGG CGCCGGCCTT CCTGAGATCA CCGCCCCGAT CACCCTCAAC
GGTGGCACAC ACACCAGCAT CACACGCGCC GCCGCAGCGC CCCTGTTCAG AATTCTCACC
GTTGAGGCCG GCGGAGACCT CACCCTCAAC CACCTCACCA TCACCGGCGG ACAGACCACC
GACAACGGCG GAGGAATCCT TGTCAACACC GGCGGAACAC TGGCCACCAA GCACAGCACC
ATCATCCGCA ACATTGCCGG AGGCTCAGGT GGCGGCATTT TGAGTTTCGG GAACACATCC
ATCGACTCAT CCACTATATC CTATAATTCT GCGATTATAG ACGGTGGCGG TATCAACAGT
CAACGGCAAA TATATTTGAC AAATTCTTTC ATTGTCAATA ATCAGACTTC TATCAACAGC
GGCGGCGGTG CCGCGATTGT GGGATCCGCG GCCATCGCGA AGACCGCGAT CACGGGCAAC
GGCGCCGGGA ATGCCGGCGG CGGTCTCGTC ATTTTTGCAG CAGCGGCTGC GATTACTGAT
TCCAAAATCG TCGACAATGC CAGCGCTAAC AATGGTGGGG GAATTTCCGT TCCTGATGGC
GCTCAGCTTA CGTTGCGGCG GGTAGCCGTC GCGGACAATC GGGCCGGCGG CACCGGCGGC
GGGCTTTTCG TAAACCTCAA CAATCAGGTC GTGGTCGAAG ACGCCAGGTT CGAGAGGAAC
ACCGCAACAG GAGACGGTGG CGGAATCAGC AACCTCAGCG ATACGGTGGT GCGGCAGTCC
AAACTCTCCG ATAACCAGGC CGACGAGGGC GCCGGCGTAT TCAACGGTGC CACCGCTACG
CTTACCCTCT ACTCCACTAA AGTCGTCAAG AATACGGCCG TCACTGACGG CGGTGGGATC
TTCAACGAGG CGGGCGGCAC GGTCGAGTTG AACACCGCCA CCGGCACCAC CGTGGCAAAG
AACCGGCCGA ACAACTGCGT CAACGTCACC GGCTGCCCAG ACTGA
 
Protein sequence
MNDQRHDHEP DRPGARRARS RWWNVGLAGM TGLALTTSLG IGGAPATGTV DRTLTATDDR 
PGKPDHASTD NKDHEGKGKK GTRKGTPVPC SADALIAAIT LGNARGGAVL DLASKCTYLL
TADIDDGAGL PEITAPITLN GGTHTSITRA AAAPLFRILT VEAGGDLTLN HLTITGGQTT
DNGGGILVNT GGTLATKHST IIRNIAGGSG GGILSFGNTS IDSSTISYNS AIIDGGGINS
QRQIYLTNSF IVNNQTSINS GGGAAIVGSA AIAKTAITGN GAGNAGGGLV IFAAAAAITD
SKIVDNASAN NGGGISVPDG AQLTLRRVAV ADNRAGGTGG GLFVNLNNQV VVEDARFERN
TATGDGGGIS NLSDTVVRQS KLSDNQADEG AGVFNGATAT LTLYSTKVVK NTAVTDGGGI
FNEAGGTVEL NTATGTTVAK NRPNNCVNVT GCPD