Gene Sare_4439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4439 
Symbol 
ID5705917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5014733 
End bp5017216 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content73% 
IMG OID641273855 
Producthypothetical protein 
Protein accessionYP_001539204 
Protein GI159039951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.041911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0019697 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGAAT GGGAGCCGGC CACCGAGGCG GAGACCGCGT TGCGTGACGC GCTGCGCGCC 
AACGACCAGC AACGATACTT CCGCATCCTG GCCCGTACGG ATCTGCTGCT ACCCGTGTCC
GCGCAGGCGC TCGCCGGTCA GACGCCGATG AACTGGGGCA CCTGGACCAC CAGCGGCCGA
ACCCACGTGC TGGCTTTCAC CTCCGTCGCC GCGCTGCGCG CCTGCCTCGG CGAGCACGCG
GGTGCGAACC GTCGAGTCGC GTACGGTGAG CTGGCCGACC ACTGGCCCAA CCACGAGTGG
TGGCTTGCGG TGAACCCGGG ACTGCCCATC GAGGGATACC TGCCGGCGTG GTATGTCGCC
CAGCTCTCCC GCGGCGATGT CCGCCTGCCC GGTCGCCCCA TGGGGACACG GGCCCGGCTG
GAGCGTACCG AGACCTACAC CCAGCCCCGT TCCTTCGGCG CATCCTCACG TCCGGAGGTC
GAGCCGGCCG TGGGGCCACC GGGGCAGGGC CGCGTTCCGC GGGCCGGCCA GACACCCGAG
GACACCGCGC CGCTGCGCCC GACCTCATCG GCGCCGCCCC CAGGCACACC GCTGCGGACC
CCAGCCGACC TGCGCGCGGC AGCTCTTCGC CGGCCGTTGA TGGATACCGG CCGCACCGCG
GCGACCGGTG GGGAGCCGGC CGCCCTCTGG CCCCGTCCCG GCCACCCGCC CGCCGACGGG
GGTGCGCCCG CGGACAGGCC TCCCCCGGAG GAGACCGAAC GGCCGATCCG CCGTTCCTTC
TTCGAGCCGG CGACCGTCCG CCCCAACCGG GACGAGCGGG GCGTCCCACC GTCGCGCCTC
GCCCGGGGCA GCCAGCAGTT CCCGCGTCGT CGGCTGGCCG ACGACGCGCC GCCGGCCGCC
GCGCCGATGT TCTCCGCCGG AGCCGAGACG CGGGCCGGTC AGGACGGTTC CGTCGCCCCA
CCGGGCGGCG GGACAGCACC CCCGGTGTCC CCGCAGCAGG CGACGGCAGA GGAGGCCCGG
ACCGTCCGCT TCCCGGTGCC GGACTCGGCG ACCCTCCGGC TGTACGCGCA TCGTGACTCG
GAGGAGGCAG CGTCCCGCCC CCGCCCCTCG CCCGTGGGTC CGGAGCCGGC CCCGAACACC
GAACCGGCTT CCGGCCCACC GCTACCCCGG CGAAGTCCTC CTCCGACCGT GATCGAGGGC
ACGATCATCG AGTCGCGGAA CCTGACCGCC GGATCGGACG CCGCATCGGC CACCATGCCG
ATTCCCGCGA CGGCGGGTGC GGTACCCGGC CCGATCGCCG CTGGGGCGGC ACCCCCGGAC
ACCGCCGCAC CACCCGGTGC CGAGCCGGGC GCCCTGCTGC CCGCCCAACC GTCGGTAGCG
GAAACTGCGG CGGAGCCCGG CGGAAGCGAC TCCGCGCATG CCACGGAACG GGTCGACTCC
GCGACGGCAG AGGACGTGCC CCGTCCAGGC CCGCCACCGG CACCCACATT GTCGGACTTC
GCGCCGGCCA ACGCGGTCGA GGAGGAACTG GTCACCGCGG CCACCAGCGG CAACCCCGAC
ACCTTGCTGT CGACGCTGCT GCTCGCCCAG GTGCTGCTGC CAATAGCGGC CGATTCGGCT
CCAGGTAGCC GGCCGGGTGA GGAGGGGTTC GTCTGGCGGA CCGAGGACCT GGATGGGGGG
TTGTCCGTCG TCGTCTACAC CTCCCCGGAG CGGCTCGCCG ACCACACCGC CGGGGGCATC
GAGACCATCC GGGTCCGCTT TGTGCAGCTC ATCCGCCGCT GGCCAGACCG ATCGTGGTCA
TTCGTCGTCA ATCCGGGAAC CCCGATCGGC ACGAAGCTGC CCGGCGAGCA GATCGTCGGT
CTCGCAACCT GGGCGGCCGA GGTCGGCTTG GGCGACGACC GCCCGGAACC AGAGCCGGTG
CCGGCCGGCA AGGCGGTACC GACCGACCCG CCGGCACCGA CTGCCCGGCC GCGCTACCGG
CCGCCGGGCC CCGAATCCAC GAGCCCCGTC GTGATGCAGA AGGCAGTGGC CGCCAGCCAA
CTCGCCTACT ACCTGGAACG GGGCTACGAC CGGGTTTCCG GCTTCGTGCA CCGCGCGGGC
GAGCTGGCGC ACCTGACCAC ACCGGCCCAG CTGTACGACG CGCTCGGGCT GGGCTACCCC
GGCTCCCCGT TCGACCGGGG GGCCGAGCGG ATCTACGTGT TGCGTTGGCC GGCGTTCCGG
CCGAGCCTCT ACCGCATCCC GTACGGTGGG CAGACCGAAG CCGCGATGCG GGCGATGGAA
GGCTGGGTGA TCGAACGGCC GCCGTTCCGC GGCAATGGCT TTGCTCCGGG CGAGAGCAGT
GACGTGGTGG CGGAGTTCAA AGTCGACAGT GCGCGGCTGC CACACGGCGC GCAGCTGTGG
CGCATCGGCG CGGACGGCAC CGAGCGGATG GTCGCCACGT TGGATGCTGA TGAGACGGTG
TGGCGGCAGG TGGGTGACGG GTGA
 
Protein sequence
MTEWEPATEA ETALRDALRA NDQQRYFRIL ARTDLLLPVS AQALAGQTPM NWGTWTTSGR 
THVLAFTSVA ALRACLGEHA GANRRVAYGE LADHWPNHEW WLAVNPGLPI EGYLPAWYVA
QLSRGDVRLP GRPMGTRARL ERTETYTQPR SFGASSRPEV EPAVGPPGQG RVPRAGQTPE
DTAPLRPTSS APPPGTPLRT PADLRAAALR RPLMDTGRTA ATGGEPAALW PRPGHPPADG
GAPADRPPPE ETERPIRRSF FEPATVRPNR DERGVPPSRL ARGSQQFPRR RLADDAPPAA
APMFSAGAET RAGQDGSVAP PGGGTAPPVS PQQATAEEAR TVRFPVPDSA TLRLYAHRDS
EEAASRPRPS PVGPEPAPNT EPASGPPLPR RSPPPTVIEG TIIESRNLTA GSDAASATMP
IPATAGAVPG PIAAGAAPPD TAAPPGAEPG ALLPAQPSVA ETAAEPGGSD SAHATERVDS
ATAEDVPRPG PPPAPTLSDF APANAVEEEL VTAATSGNPD TLLSTLLLAQ VLLPIAADSA
PGSRPGEEGF VWRTEDLDGG LSVVVYTSPE RLADHTAGGI ETIRVRFVQL IRRWPDRSWS
FVVNPGTPIG TKLPGEQIVG LATWAAEVGL GDDRPEPEPV PAGKAVPTDP PAPTARPRYR
PPGPESTSPV VMQKAVAASQ LAYYLERGYD RVSGFVHRAG ELAHLTTPAQ LYDALGLGYP
GSPFDRGAER IYVLRWPAFR PSLYRIPYGG QTEAAMRAME GWVIERPPFR GNGFAPGESS
DVVAEFKVDS ARLPHGAQLW RIGADGTERM VATLDADETV WRQVGDG