Gene Sare_4273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4273 
Symbol 
ID5705778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4848661 
End bp4850334 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content73% 
IMG OID641273692 
Producthypothetical protein 
Protein accessionYP_001539045 
Protein GI159039792 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.4813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTTC GGACCTGGGG CAAACTGCTA CTCACGGCGC TCGGAGCGAG CCTGCTGGCC 
GGAGCCATTC AGCTCGGCAT CGCATTTGGC TTCGGCATCG TTCGACTCAC CGGGGCCTTC
ACCGGCGACA GTGTCAACCA GTGGCCGGCG CAGCTCACCT GGGTCGGTTG GATAGCGGCG
AATGCCGCCG TCGCCGGTGC CATCGGCGCC GAACGGTTGG CTCGCCGGGA CGGCCTGCTG
ACCGGTATCG GTAGGCAGGC GGCCGTAGCG GGTGCCGCGA CGCTCGGTGC CATCGTCGTC
GCGCCGCTCT GCATGCAGCC CGCGCGGGCC GCGGACCTGG TCACTGCGGA ACCGGTCTGG
GCCGTCGCCA CCTGCGCCGT CCTCGGCGCG GTGGCCGGCG GTGGTGCGGC GCTGGCCGTC
CTGGCACGGC CCCCGCTGGG CTGGAGCGTG GCGCTCGTCG CCGGCGTGGT GTGGCTTCTC
GCTCTGATCT CGGTTGCCCC GTCGCTGGGG GAGACCGGGC CACTGCGGAC CGTACGCCTC
GGCATGTGGG AACCGTCCTG GCTGAGCGCC ACCGCGGCGC ACCGCCTGTC TCTGCTGTGT
CTGCCGGTGG TGGCACTGCT GGTTGGAGCG TCCACGGGCG GCCTGGCCCG TCGGCGCGAG
CTCCCGCCAC TGGTCGGTGG CCTGACCGGG GTAGCCGGAC CGGTGCTGCT GGCCTTCACC
TACCTGGCCG CGGGTTCCGG CGACGAGGTG GATCGGTACC AGGCCGCGCC CTACTACGGC
GCTCTGCTCG CGGTGGCCAC CGGTGCGCTT GGCTCGGCCG CCACCGTCGT GCTGCGCTGG
CCACTCGTGG TGCACCCGGC TGACCGGTCC GTTGCGACAC CCGGCACAGA GGCCGCCGGC
GCCATGGTGG ACGACACCTC ACCGAGCACC AACCCGGCAT CGAACCCCGA GACCACGCTG
AGCCCTGAGC CACCACCGAA CACCGAACCA CCACCGAACA CCGAACCAAC CCTGAGTCCC
GGGCCCGCTC CGACCCCCAA GCCGGCACCG AGCCCTCGCC GCGCGGGGCG ATCCGAGGTC
ACCTCCCGAC CCACTCCAGT CACCCCAACA CCGGTCACAT CGCCCCGGCC CGTCCCGACC
CCCGTCGAGT CGACCACACG TACCGCCGTT GACCCCGTAC CCGCAACACC ACCCCGGCCC
GTCCCGACCC CCGTCGAGTC GACCACACGT ACCGCCGTTC AGCCCGTACC GGTAACCCCG
CCCCGGCCGA CACCGACCCG GATCGGACCC GAGCCGTTCC CGATGCTGTC GCCGACACCG
CCGACACCGC CCGTGACCCG CACTTTCCCG GCAGACCCGC CCACCTCCAC CGCCGCACCT
GTCGTGTCCG GGGCAGCCCC GCCAACGGAC GACGGGTCCG ACCCGGACGC CCAAGGTGAT
CGGTCGGCGC CCGACGAAAC TCCGCCTGTA GCCCGCCGCC GGGGCCTGTT CCGGCGCCAC
CGCTCCCGCC CCAACGACGC GGGCGACCCG ACCGGGCCGG TACAGCTACC AGCGCAGGAC
GAGGAGTTTG TCGACTGGGT CACCGGCCTG AGCAAGCCTG CTCCGGACAA CGAAGCCGAC
CCGGAACGCG TCCGACGCTC GTTGCGCTCC GTCGGCCGAC ACCACGCCGA CTGA
 
Protein sequence
MAFRTWGKLL LTALGASLLA GAIQLGIAFG FGIVRLTGAF TGDSVNQWPA QLTWVGWIAA 
NAAVAGAIGA ERLARRDGLL TGIGRQAAVA GAATLGAIVV APLCMQPARA ADLVTAEPVW
AVATCAVLGA VAGGGAALAV LARPPLGWSV ALVAGVVWLL ALISVAPSLG ETGPLRTVRL
GMWEPSWLSA TAAHRLSLLC LPVVALLVGA STGGLARRRE LPPLVGGLTG VAGPVLLAFT
YLAAGSGDEV DRYQAAPYYG ALLAVATGAL GSAATVVLRW PLVVHPADRS VATPGTEAAG
AMVDDTSPST NPASNPETTL SPEPPPNTEP PPNTEPTLSP GPAPTPKPAP SPRRAGRSEV
TSRPTPVTPT PVTSPRPVPT PVESTTRTAV DPVPATPPRP VPTPVESTTR TAVQPVPVTP
PRPTPTRIGP EPFPMLSPTP PTPPVTRTFP ADPPTSTAAP VVSGAAPPTD DGSDPDAQGD
RSAPDETPPV ARRRGLFRRH RSRPNDAGDP TGPVQLPAQD EEFVDWVTGL SKPAPDNEAD
PERVRRSLRS VGRHHAD