Gene Sare_4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4341 
Symbol 
ID5708409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4908125 
End bp4909804 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content70% 
IMG OID641273763 
Producthelicase domain-containing protein 
Protein accessionYP_001539113 
Protein GI159039860 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGTG GACCCCTGAT CGTGCAGTCG GACAAGACCC TCCTCCTGGA GATCGACCAC 
CCCGACTCGC AGGCATGCCG GCTGGCGATC GCACCGTTCG CCGAGTTGGA ACGCTCCCCA
GAGCACGTGC ACACGTACCG GCTGACCCCC CTGGGGCTGT GGAACGCCCG GGCCGCCGGC
CACGACGCCG AGGGCGTGGT CAACGCGCTG ATCACCTACA GCCGCTATCC GGTTCCGCAC
GCCCTGCTGG TCGACGTGGC CGAGACGATG GACCGGTACG GCCGGCTCCA ACTGGTCAAC
GACCCGGCAC ACGGCCTGGT GCTGCGGGCC CTGGACCGGG TGGTGCTGGT CGAGGTCGCC
AAGAGCAAGA AGCTCGCCGG GATGCTCGGC ACGAAGCTCG ACGACGACAC GGTCACGGTG
CATCCGTCCG AGCGCGGACG GCTCAAGCAG GCGCTGCTCA AGCTCGGCTG GCCGGCGGAG
GACCTGGCCG GCTACGTCAA CGGTGAAGCC CACCCGATCG CGCTGGCCGA GGCCGGCAAG
GACGGCGGGA AGCCGTGGAC GCTGCGCTCG TACCAGCGGG AGGCGGTGGA GGCGTTCTGG
GCCGGCGGGT CGGGTGTGGT GGTGCTGCCC TGCGGCGCCG GCAAGACCCT GGTCGGGGCG
GCGGCGATGG CCGAGGCGAA GGCGACCACG CTGATCCTGG TGACGAACAC CGTCGCGGGC
CGGCAGTGGA AACGGGAGCT GGTCGCCCGC ACGTCGCTGA CCGAGGCGGA GATCGGCGAA
TACTCGGGCG AACGCAAGGA GATCCGCCCG GTGACCATCG CCACGTACCA GGTGTTGACG
TCACGGCGCG GCGGCGCGTT CACCCACCTG GACCTGTTCG GGGCGCGCGA CTGGGGTCTG
GTCGTCTACG ACGAGGTGCA CCTGCTGCCC GCGCCGATCT TCCGGTTCAC CGCCGACCTT
CAGGCCCGCC GCCGGCTGGG GCTGACCGCA ACCCTGGTCC GCGAGGACGG CCGGGAGGGG
GACGTGTTCA GCCTGATCGG CCCGAAGCGG TACGACGCAC CGTGGAAGGA CATCGAACAG
CAGGGCTGGA TCGCCCCGGC CGAATGCACC GAGGTACGGG TGACACTGAC CGATGCGGAG
CGCATGGCGT ACGCGACGGC GGAGGCCGAC GAGCGCTACC GGATGGCGGC GACCACGCGT
ACCAAGTTGC CGGTGGTGAA GGCGCTGCTC GACCGGCACC CGGGGGAGCA GACGCTGGTG
ATCGGCGGGT ACATCGATCA GCTGCACCAG TTGGGGGAGT ACTTGGACGC GCCGATCGTG
CAGGGGTCGA CCACGAACAG GGAGCGGGAG CGGCTGTTCG ACGCGTTCCG CTCGGGTGAG
CTGCAGACCC TGGTGATCTC GAAGGTGGGC AACTTCTCGA TCGATCTGCC GGAGGCGGCG
GTGGCGGTCC AGGTGTCGGG CACGTTCGGT TCCCGGCAGG AGGAGGCGCA GCGGCTCGGC
CGGGTGCTCC GGCCGAAGAT CGACGGCCGG CAGGCACACT TCTACACGGT GGTGTCTCGG
GACACGATCG ACACCGAGTA CGCCGCCCAC CGGCAACGCT TCCTCGCCGA GCAGGGGTAC
GCCTACACGA TCGTGGACGC CGACCACGTC CTTGGCCCGT CGCTGCCCTC GGTCGACTGA
 
Protein sequence
MSGGPLIVQS DKTLLLEIDH PDSQACRLAI APFAELERSP EHVHTYRLTP LGLWNARAAG 
HDAEGVVNAL ITYSRYPVPH ALLVDVAETM DRYGRLQLVN DPAHGLVLRA LDRVVLVEVA
KSKKLAGMLG TKLDDDTVTV HPSERGRLKQ ALLKLGWPAE DLAGYVNGEA HPIALAEAGK
DGGKPWTLRS YQREAVEAFW AGGSGVVVLP CGAGKTLVGA AAMAEAKATT LILVTNTVAG
RQWKRELVAR TSLTEAEIGE YSGERKEIRP VTIATYQVLT SRRGGAFTHL DLFGARDWGL
VVYDEVHLLP APIFRFTADL QARRRLGLTA TLVREDGREG DVFSLIGPKR YDAPWKDIEQ
QGWIAPAECT EVRVTLTDAE RMAYATAEAD ERYRMAATTR TKLPVVKALL DRHPGEQTLV
IGGYIDQLHQ LGEYLDAPIV QGSTTNRERE RLFDAFRSGE LQTLVISKVG NFSIDLPEAA
VAVQVSGTFG SRQEEAQRLG RVLRPKIDGR QAHFYTVVSR DTIDTEYAAH RQRFLAEQGY
AYTIVDADHV LGPSLPSVD