Gene Sare_4776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4776 
Symbol 
ID5704443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5405405 
End bp5406874 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content70% 
IMG OID641274174 
ProductMlrC domain-containing protein 
Protein accessionYP_001539520 
Protein GI159040267 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.76631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00013029 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCTCGCC CCCTCCGCGT CGCCATCGGC GGCATCCACA TCGAATCAAG CACATTCTCA 
CCCCACCTGA GCACCGCCGA CGACTTCGAG GTCACCCGGG AGGAGGCGTT GCTCGCGCGA
TACGCCTGGC TCTCGCCCAC CCAGCCGTGG GCCGCTGACG TCGAATGGAT ACCACTCGTG
CACGCCCGGG CCCTTCCCGG CGGTGCCGTG GACGCCGAGA CGTACGCGCA ATGGGCCGCG
GAGATCGTGG AGAAGCTCGC CGCAGCCGGC CCACTGGACG GCGTCCTGCT GGACATTCAC
GGTGCGATGA GCGTGGTCGG CCGGTCCGAC GCAGAGGGCG ACCTGGTCAC CGCCATCCGG
TCGGTGATCG GTCCCGAGCC GTACATCTCG GCGGCGATGG ATCTGCACGG CAACGTCTCA
CCGGTGCTCT TCGACGCCTG CGACCTGCTC ACCTGCTACC GCACCGCCCC ACACGTCGAC
GTGTGGGAAA CCCGGGAGCG TGCCGCACGC AACCTCATCG AGGCGTTACG CCGTGAGCGA
CGTCCGCACA AGGCGCTGGT CCACGTGCCG ATCCTGTTGC CCGGCGAGAT GACCAGCACC
CGCGAGGAAC CGGCCCGGGG ACTCTACGCA CGGATCCCCG AGATCGAGGG CCGTGCCGGT
GTGGTCGACG CCGCGATCTG GATCGGGTTC GCCTGGGCCG ACCAGCCACG CTGTCAGGGC
GCGGTCGTCG TCACCGGCAC CGACGCCACC GTCACCGCCG AGGCGGCCCG CGAACTCGGC
GGGCATTTCT GGGCGGCCCG GGACGAGTTC ACCTTCGTCG CTCCCACCGG CTCGATGGAC
GAGTGCCTCG ACACCGCCCT GGCCGCCGTC GCCGATCAGG CGAAGCGGCC GTTCTTCATC
AGCGACTCCG GCGACAACCC GGGCGCCGGC GGCGCCGACG ACGTCACCTT CGCCCTCGAC
CGGATGTTGG CCCGCCCCGA GATCCGCGAC GGCTCGCGTC GGGCCGTGTT CGCCTCGCTG
GTCGACCCGC AGGCGGTAGC GCTTGTCGCC GATCAGCCGG TCGGCTCTCC GGTCCGGGTG
TCGGTCGGCG GGCGGATCGA CTCCCGGCCC CCGGGCCCGG TGGAACTCGA CGGGATTTTG
GAAGCGGTCG CCGACGATCC CGACGGCGGG CGATGCGTCA GCGTACGGGT CGGCGGGCTC
AGCATCTTCG TCACGTCCCG CCGGATGCAG TACCGGCTAC TGGCGTCGTA TACCCGGCTG
GGGGTCAGGG TCGACGAGGT GGACGTGGTC GTGGTGAAGA TCGGCTATCT CGAGCCGGAA
CTGTTCGACG CCGCCGGAGA CTGGCTGCTC GCCCTGACTC CGGGTGGCGT CGACCAGGAC
CTTGCGCGGT TGCCGTACCA GAACGTGGTG CGACCGGTGT TTCCCCTGGA CCGTGACTTC
GCGGCGGACC TCGCCGTGGT TGTCGGATGA
 
Protein sequence
MPRPLRVAIG GIHIESSTFS PHLSTADDFE VTREEALLAR YAWLSPTQPW AADVEWIPLV 
HARALPGGAV DAETYAQWAA EIVEKLAAAG PLDGVLLDIH GAMSVVGRSD AEGDLVTAIR
SVIGPEPYIS AAMDLHGNVS PVLFDACDLL TCYRTAPHVD VWETRERAAR NLIEALRRER
RPHKALVHVP ILLPGEMTST REEPARGLYA RIPEIEGRAG VVDAAIWIGF AWADQPRCQG
AVVVTGTDAT VTAEAARELG GHFWAARDEF TFVAPTGSMD ECLDTALAAV ADQAKRPFFI
SDSGDNPGAG GADDVTFALD RMLARPEIRD GSRRAVFASL VDPQAVALVA DQPVGSPVRV
SVGGRIDSRP PGPVELDGIL EAVADDPDGG RCVSVRVGGL SIFVTSRRMQ YRLLASYTRL
GVRVDEVDVV VVKIGYLEPE LFDAAGDWLL ALTPGGVDQD LARLPYQNVV RPVFPLDRDF
AADLAVVVG