Gene Sare_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1823 
Symbol 
ID5706469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2100976 
End bp2102916 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content68% 
IMG OID641271325 
Producthypothetical protein 
Protein accessionYP_001536700 
Protein GI159037447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000592634 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGCGG TCTTGGTGTG TGCGGTGACC GTCGCCGCCG CGGTAGCCGT ACCGGCCGCG 
GCGTCCGCGA CCGTGCCACC AGCGCCCGAG ACACAGACCG TCGTGTCGGC GGACCCGGCC
GACCTGACGC CGCACGCCCG CGACGGCGAG GTGCGCGCGT TCGCGCAGGT CGGTTCGATG
GTGTACGTCG GGGGCAGCTT CACCCAGATC CGACAGGACC GGGACGCGGC CTGGAGCACC
CAGCGTTACC TGTTCGGCTA TGACAAGACC ACCGGGCTGA TCTCGACGAC CTTCCTGCCG
GTGCTCGATG GCGCGGTGAA CTCGCTGGTC GCCGGTCCGA ACGAGACGTT GATCGTCGGC
GGGGCGTTCC GCAACGTGAA CGGTGTGTCC CGAAAGAACC TGGTGGCACT CGACCCGAGC
ACCGGTGCCA CCATCGACGC CTGGGAGGGT CGCTCCGACG GAGGCACCGT GCGGGACATG
GCGCTGCACG GCAACTGGCT CTACCTGGGC GGCGCCTTCA ACTGGATGAA CGGTGCCTCG
CACAAGGGGC TCGCCCGACT ACACGCCAGT ACGGGGGCGA TCGACCCGAC CTTTGCCATC
GACGCCACCG ATGGACGAGC CAACACCAGC TCGTATGTCT GGACCATCGA TGTGTCGCCG
GACGGGGACG AGCTGGTAGC TGGCGGCAAC TTTACGGACG TCAACGGCCT GTCGCGGAGC
CAGATTGCAC TGATCGACCT CACCGGTACG CCGAGTGTCA TCGACTGGAG CACGGAAAAG
TTCGTCTCGC CCTGCGGCGG AGCCCCGGCC TTCTCCTACC TGCAGGATGT GAGCTTCAGT
GAAGACGGCA GCTGGTTCGT CGTGGGTACC AACGGTGGCT CGGGCTGGCC TGCCGCGTAC
TGTGACGCGC TGGTGCGCTT CGAGACCGCG GCCCGCGGCG ACGGTCAGGT CGGCACCTGG
GTGAACTACA CCGGCAGGGA CACCATCACC TCGGTCGAGG TGGCTGACAA CACCATCTAC
CTTGGCGGGC ACTTTCGCTG GCTGAACAAC CCCAACACGC AAGACAAGGC TGGCGACGGT
GCCATTGACC GGCTCGGTGT CGCCGCCGTG TCTCCAGCCA CCGGAATGCC GGTGAACTGG
AACCCGCGTC GCAGTGGCGG CTCGTCGATG CCGCCGGGCA CCAGCGAATG GGGCTCCACC
GTGCCGGTGC TCTGGCGGGG CGACGACGGG CTGTACTTCG GGCACAACTC CGACGGAATG
GGTAGCGAGT ACCACGGCCG GCTCGGCATG TTCCCGCTGT CCGGCGGCCG GACGATCACC
CCGAAGAACC CGTCGACCGC GACCAGCGGC AATCTCTACC TGGGCACCGG GGAGGGCGAA
CTGGCACGGG TCCCGTTCGA CGGCGCCGGT ATCGGTTCGC CCACCACGGT CAGCCAGCCG
AACTACACCG CCGCCGGAGC GACGTGGGGG ATGAGCGACC GGATCTACTG GGTGCACACC
GTCGCCGGCA CCCCGACGGG CAGCCGGATC GACATCTCGA TGTTCAATGG GGGCGCGGTC
GGTGTCCCGT GGGAGACCTC CGGCTACAAC AACTGGTTCG ACGCGGCGGA CATGGCTGGC
GCCTTCTTCC TTGACGGGCG CCTCTACTAC GCCCGGGCCG GCTCCGACGC CCTCCACTAC
CGGTACTTCG AGACCGATGG GAACTACCTC GGCGCCACCG AGTTCACGCT GCCCACCACC
GGGATCACCT GGTCGGAGGT GCGGGGCATG GCCTGGGTGG GCGGCAGCGT CGTGTACGGG
AACATTGACG GAGGCCTACG CAGCGTCCCC TTCGACCCGG CGGGCGACCC GGCGGTCAAC
GGGGCGGAAA GCACTCTTCT CGCCGCGGCG ACCGCAGAGC TGACCTGGTC CACTCCGCGG
ATGTTCTTCG CCGTACAGTA G
 
Protein sequence
MRAVLVCAVT VAAAVAVPAA ASATVPPAPE TQTVVSADPA DLTPHARDGE VRAFAQVGSM 
VYVGGSFTQI RQDRDAAWST QRYLFGYDKT TGLISTTFLP VLDGAVNSLV AGPNETLIVG
GAFRNVNGVS RKNLVALDPS TGATIDAWEG RSDGGTVRDM ALHGNWLYLG GAFNWMNGAS
HKGLARLHAS TGAIDPTFAI DATDGRANTS SYVWTIDVSP DGDELVAGGN FTDVNGLSRS
QIALIDLTGT PSVIDWSTEK FVSPCGGAPA FSYLQDVSFS EDGSWFVVGT NGGSGWPAAY
CDALVRFETA ARGDGQVGTW VNYTGRDTIT SVEVADNTIY LGGHFRWLNN PNTQDKAGDG
AIDRLGVAAV SPATGMPVNW NPRRSGGSSM PPGTSEWGST VPVLWRGDDG LYFGHNSDGM
GSEYHGRLGM FPLSGGRTIT PKNPSTATSG NLYLGTGEGE LARVPFDGAG IGSPTTVSQP
NYTAAGATWG MSDRIYWVHT VAGTPTGSRI DISMFNGGAV GVPWETSGYN NWFDAADMAG
AFFLDGRLYY ARAGSDALHY RYFETDGNYL GATEFTLPTT GITWSEVRGM AWVGGSVVYG
NIDGGLRSVP FDPAGDPAVN GAESTLLAAA TAELTWSTPR MFFAVQ