Gene Sare_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4028 
Symbol 
ID5706432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4579822 
End bp4581888 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content70% 
IMG OID641273453 
Producttranscription termination factor Rho 
Protein accessionYP_001538809 
Protein GI159039556 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0016816 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCGACA CCACCGACGT GACGTCGGAT GTTTCCAACG TCGCCGGCGA TGCCACGACC 
GCTGCTCCCA CCCGTCGTCG GCGTAGCGGC ACGGGTCTGT CGGCGATGCT GCTGCCTGAG
CTTCAGAGCC TGGCTGCGTC GCTCGGCATC TCGGGCACGG CTCGCATGCG TAAGGGTGAG
CTGATCAGCG CGATCACCGA GCGCCAGGGT GGCGGGGCAG CCACCGGAAC CCCTCGACCG
CGGGCCGAGG TCGCGGCTGC TGCCGCGCTC GCCCGCGAGG AGGTCCACGC GGAGGTCCGC
GAGTCGGGCG AGCGGCCGGA GGCCGAGTCA CGTCCTGCGG AGCAGCCAGC CGCTGGTGGG
ACCACCGGCC GGGCTCGGGG CCGGCGTGCC CGTGCGGCCA GCGAGGCCAG TGAGGCCCGT
GTCGAGGCGC GACCAGAGGG CGCCGAGGCC GGCGACCGGG CCGAGCGTGG CGATCGGGCC
GAGCGTGGTG ACCGGGCCGA GCGTGGTGAC CGGGCCGAGC GTGGCGATCG GGCCGAGCGT
GGCGATCGGG CCGAGCGTGG TGACCGGGCC GAGCGTGGCG ATCGGGCCGA GCGTGGCGAT
CGGGCCGAGC GTGGCGATCG GGCCGAGCGT GGCGATCGGG CCGAGCGTGG TGACCGGGCC
GAGCGTGGCG ATCGGGCCGA GCGTGGTGAC CGGGCCGAGC GTGGTGACCG GGCCGAGCGT
GGTGACCGGG CCGAGCGTGG TGACCGGGCC GAGCGTGGCG ATCGGGCCGA GCGTGGTGAC
CGGAACGACC GCGGCCAGCG TGACAATGAC GGCGACGAGG AGAACGAGGG TGGCGGCCGG
CGCGGCCGGC GCAGCCGATT CCGGGACCGT CGTCGTGGCC GTGGCGACCG GGTCGACGGC
GGCGACGGTG GCCGGGAGCC CCAGGTCAGC GAGGACGACG TGCTCGTTCC GGTGGCAGGC
ATCATCGACG TGCTCGACAA CTACGCCTTC GTCCGGACCA CCGGCTACCT GGCCGGTCCG
AACGACGTCT ACGTCTCGAT GTCCCAGATC AAGCGGTACG GCCTGCGGCG CGGTGACGCG
ATCACCGGTG CCGTGCGCGC GGCGCGGGAG GGCGAGCAGC GGCGGGACAA GTACAACCCG
CTCGTTCGGC TGGACACCAT CAACGGGATG GAGCCCGAGG AGGCGAAGCG GCGGCCGGAG
TTCTATCGAC TCACCCCGCT CTACCCGCAG GAGCGGCTGC GGTTGGAGAG CGAGCCGCAC
ATCCTCACCA CCCGGGTGAT CGATCTGGTG ATGCCGATCG GCAAGGGTCA GCGGGCGCTC
ATCGTTTCGC CGCCGAAGGC CGGTAAGACC ATGGTGTTGC AGGCGATCGC GAACGCGATC
ACCCACAACA ACCCGGAGTG CCACCTGATG GTGGTGTTGG TGGACGAGCG CCCAGAAGAG
GTCACCGACA TGCAACGGTC GGTGAAGGGC GAGGTCATCG CGGCGACGTT CGATCGCCCG
CCGCAGGATC ACACCACCGT CGCCGAACTG GCGATCGAGC GGGCGAAGCG GCTGGTCGAG
CTGGGCCACG ACGTGGTCGT GCTGCTCGAC TCGGTGACGC GGCTCGGGCG GTCGTACAAC
CTGGCGGCAC CGGCCAGTGG TCGGATCATG TCGGGTGGTA TCGACTCCAC CGCGCTGTAT
CCACCCAAGC GGTTCCTGGG CGCGGCCCGC AATATCGAAA ACGGCGGCTC GTTGACCATC
CTCGCCACCG CGCTGGTGGA GACCGGGTCG ATGGCGGACA CGGTCATCTT CGAGGAGTTC
AAGGGCACCG GCAACGCGGA GCTGAAGTTG GACCGGAAGA TCGCCGACAA GCGAACCTTC
CCGGCTATCG ACATCCACCC GTCCGGCACG CGTAAGGAGG AGATCCTGCT CGCGCCGGAG
GAACTGGCCA TCGTGCACAA GCTTCGGAAG GTGCTGCACT CGCTGGATTC GCAGGCCGCG
CTGGACCTGT TGCTGGACCG CCTCAAGCAG TCGCGCACCA ACATCGAGTT CCTGATGCAG
ATCGCGAAGT CGACACCGGG GGAGTGA
 
Protein sequence
MSDTTDVTSD VSNVAGDATT AAPTRRRRSG TGLSAMLLPE LQSLAASLGI SGTARMRKGE 
LISAITERQG GGAATGTPRP RAEVAAAAAL AREEVHAEVR ESGERPEAES RPAEQPAAGG
TTGRARGRRA RAASEASEAR VEARPEGAEA GDRAERGDRA ERGDRAERGD RAERGDRAER
GDRAERGDRA ERGDRAERGD RAERGDRAER GDRAERGDRA ERGDRAERGD RAERGDRAER
GDRAERGDRA ERGDRAERGD RNDRGQRDND GDEENEGGGR RGRRSRFRDR RRGRGDRVDG
GDGGREPQVS EDDVLVPVAG IIDVLDNYAF VRTTGYLAGP NDVYVSMSQI KRYGLRRGDA
ITGAVRAARE GEQRRDKYNP LVRLDTINGM EPEEAKRRPE FYRLTPLYPQ ERLRLESEPH
ILTTRVIDLV MPIGKGQRAL IVSPPKAGKT MVLQAIANAI THNNPECHLM VVLVDERPEE
VTDMQRSVKG EVIAATFDRP PQDHTTVAEL AIERAKRLVE LGHDVVVLLD SVTRLGRSYN
LAAPASGRIM SGGIDSTALY PPKRFLGAAR NIENGGSLTI LATALVETGS MADTVIFEEF
KGTGNAELKL DRKIADKRTF PAIDIHPSGT RKEEILLAPE ELAIVHKLRK VLHSLDSQAA
LDLLLDRLKQ SRTNIEFLMQ IAKSTPGE