Gene Sare_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4398 
Symbol 
ID5703447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4969987 
End bp4971813 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content70% 
IMG OID641273817 
Producthypothetical protein 
Protein accessionYP_001539166 
Protein GI159039913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.5486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0412393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG ACCATCCCGC CGGACCCTCG GGCCGGCACC TCGGCCGGCA CGGGCCGGAC 
GGCGGGGGCC CGGGCGAGGC GGTGCCCGGC TCCGGCCCGG CGCCCACGTC ATCGCCGAAC
GGCCCCCATG CCAGTGAGGC GGCCGGCGGC CGGCCGGCGT GCCGGCCCCG CAGCACCGAC
CCGGCCGAGT TGGGCTTCAC CCCACGCAAA CCGGTCCCGT GGCTGGCGCC GTTCCTGCTG
GTCAGCACCG GCATCCGTAC GCTGCTCGCG CTGCTCTTCG GCGCCTACCT GGACAAACGA
GAGCTACAGA CCGCCTTCGA CGCGAAGATC AGCCGGCAGG TTGGGCCAGA CGGTGGTGTC
TGGCTGGACT ACGTCGCCGA CCTTGGTGAC GGCTTCGACG CCACCTACTC GGTCGCGTAC
CTGCTGGCCC AGCGGGAGCT GATGGTCGAA GGACACCGGC TGCCCCGGGC GCAGGTGCTG
GTGATGGGCG GCGACCAGGT CTATCCGTCG GCGGCCTTCG ACACATACGA GGACCGGTGC
AAGGGCCCCT ACCAGGCAGC GCTGCCAGTC ACTCCACCCG AGCAGCCGAC GTTGTTCGCG
ATCCCCGGCA ACCACGACTG GTACGACGGT CTCACCGCCT TCCTGCGGCT CTTCGTCCGG
TCCCGGGACC GGCACTTCGG CGGCTGGAAC ACCGAGCAGT CCCGGTCGTA CTTCGCGGTG
GAACTGCCGG CGGACTGGTG GCTGTTCGGC CTGGACGACC AGTCGGGTTC GTACCTGGAT
GACCCACAGC TCACCTACTT CGACGACGTG GCCGAGCGGC TGGGGCCACA GAGTCGGGTG
ATCCTGGCGG TGCCGATGCC GACCTGGGTC AAGGCCACCA AACACCCGAC GGCGTACGAC
TCGATCGACT ACTTCATCCG CACCATCGTC GCGCCGACAG GGGCGCAGGT GCGGCTCCTC
ATCTCCGGTG ACCTGCACCA CTATGCCCGG TACGCGGGGC CGGACCGTCA GCTGATCACC
TGTGGTGGCG GCGGTGCGTA CCTCTACCCG ACGCACCTGT TGCCGGAGCG GATCCAGGTC
CCACCGAAGG AGACGCTGGC CCGGCGGGCG AGTGCCACGC AGGTGTACGA GTTGGCGGGG
CGATACCCCG ACGTGGCGCG GTCCCGGCGG TACGCCTGGG GCGCCTTTCT GCGGCTGCCG
TTACGTAACC CGGGTTTCAC CACGCTGCTC GGTGCCCTGT ACGCGCTGCT GGTCCTGGCG
ATGGTCGGGG TCTGCACGAA CCGCGATGAC GCCCAGCTGC GACTGTTCAG CGTTCCGTTG
GCGGCGATGC TGCTGGTGAC CCTGCTCGGG GCGTTCTTCT TCGCCAAGCC GCCCGGTTCC
GCGGGCAAGC GACGCCTTCG GCACTGGCTC CTCGGCGTGG GGCACGGTCT GGCGCACGTG
GCGTTGGCGG CAGGCGGCAC GTGGGTGTGG CTGGCACTGC CGTTCCACGA CTGGCCGTGG
CCGCTGTCGG TGGTCGCCGC GGTCGTGTTC TTCGGGTCGG TGGGCGGCCT GGCAGCAAGC
CAGCTGGTGG CGGCGTACCT GCTGGTGGCC GGCGCGTTCG GGGTCAACGT CAACGAACTC
TTCGCCGGTC AGGGCATTGA GGACGCGAAG GGTTTCCTGC GTATGCACAT CGCCCCGGAG
GGGACGCTGA CGATCTACCC GATCGGGCTC GACCGGGTGG GTCGCCACTG GCAGGTCAAC
CCCGACCTCT CCGCCGAGTC GTCGTGGCTG GTCCCGGGCA TCCCGCTGGA GCCTCGCCTG
GCCGAGCCCC CGCTGGTCCT CCGCTGA
 
Protein sequence
MTDDHPAGPS GRHLGRHGPD GGGPGEAVPG SGPAPTSSPN GPHASEAAGG RPACRPRSTD 
PAELGFTPRK PVPWLAPFLL VSTGIRTLLA LLFGAYLDKR ELQTAFDAKI SRQVGPDGGV
WLDYVADLGD GFDATYSVAY LLAQRELMVE GHRLPRAQVL VMGGDQVYPS AAFDTYEDRC
KGPYQAALPV TPPEQPTLFA IPGNHDWYDG LTAFLRLFVR SRDRHFGGWN TEQSRSYFAV
ELPADWWLFG LDDQSGSYLD DPQLTYFDDV AERLGPQSRV ILAVPMPTWV KATKHPTAYD
SIDYFIRTIV APTGAQVRLL ISGDLHHYAR YAGPDRQLIT CGGGGAYLYP THLLPERIQV
PPKETLARRA SATQVYELAG RYPDVARSRR YAWGAFLRLP LRNPGFTTLL GALYALLVLA
MVGVCTNRDD AQLRLFSVPL AAMLLVTLLG AFFFAKPPGS AGKRRLRHWL LGVGHGLAHV
ALAAGGTWVW LALPFHDWPW PLSVVAAVVF FGSVGGLAAS QLVAAYLLVA GAFGVNVNEL
FAGQGIEDAK GFLRMHIAPE GTLTIYPIGL DRVGRHWQVN PDLSAESSWL VPGIPLEPRL
AEPPLVLR