Gene Gobs_4277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4277 
Symbol 
ID8755971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4493115 
End bp4496342 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content72% 
IMG OID 
Productpreprotein translocase, SecA subunit 
Protein accessionYP_003411210 
Protein GI284992656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.259413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCTCGA AGATCCTCCG AGCCGGTGAG GGCAAGATCC TCCGACGACT GAACAAGATC 
GCCGACGCGG TCGAGTCGCT GGCGGAGGAA ACTGCCGACC TCACCGACCC GGAGCTGCGC
GCCAGGACCG ACGAGTTCAA GGAGCGCCTG GCGGAGGGGG AGACCCTCGA CCAGCTCCTG
CCGGAGGCCT TCGCCGTCGT CCGCGAGGCC GCTACCCGCA CCCTCGGTCA GCGGCACTTC
CGCGTGCAGG TCATGGGCGG CGCCGCGCTG CACCTGGGCA ACATCGCCGA GATGCGCACC
GGTGAGGGCA AGACGCTGAC CGGCGTCCTG CCCGCCTACC TCAACGCGCT CACCGACCAG
GGCGTGCACG TGGTCACGGT GAACGACTAC CTGGCCAAGC GCGACGCGGA GTGGATGGGC
CGGGTGCACC GCTTCCTCGG CCTCTCCGTC GGCGTGATCC TCTCCGGCGA GCGCCCGGCG
CAGCGCCGCG AGCAGTACGC CTGCGACATC ACCTACGGCA CGAACAACGA GTTCGGCTTC
GACTACCTGC GCGACAACAT GGCGTGGAAC AAGTCCGACC TCGTGCAGCG CGGGCACCAC
TTCGCCGTCG TCGACGAGGT CGACTCGATC CTCATCGACG AGGCCCGCAC GCCGCTGATC
ATCAGCGGCC CGGCCGGCGA CCCGGCGATG CACCGCTGGT ACACCGAGTT CGCGCGGCTG
GCCCCGATGA TGCAGCGCGA CGTCCACTAC GAGGTGGAGG AGGGCAAGCG CACGGTCGCC
ATCACCGAGG AGGGCGTGGA GTTCGTCGAG GACCAGATCG GCATCGAGAA CCTCTACGAG
GCGGCCAACA CCCCGCTGAT CAGCTTCCTG AACAACGCGC TCAAGGCCAA GGAGCTCTAC
CACCGCGACC AGCAGTACAT CGTCAGCAAC GGCGAGGTGC TCATCGTCGA CGAGTTCACC
GGCCGCGTGC TGTCGGGCCG GCGCTACAAC GAGGGCATGC ACCAGGCCAT CGAGGCCAAG
GAGCGGGTGC AGATCAAGGA CGAGAACCAG ACCCTCGCCA CGATCACCCT GCAGAACTAC
TTCCGGCTCT ACGAGAAGCT GTCCGGGATG ACGGGCACCG CCCAGACCGA GGCCGCCGAG
CTCTCGCAGA CCTACGGGCT GGGCGTCGTC CCGATCCCCA CCAACCGGCC GATGGTCCGC
GAGGACCGCT CCGACGTCAT CTACAAGACC GAGCAGGCCA AGTTCGACGC CGTCATCGAC
GACATCGCCG AGCGGCACGA GGCCGGGCAG CCGGTGCTGG TCGGCACGGC CAGCGTCGAG
AAGTCCGAGC TGCTGTCCAG GCTGCTGCTG CAGCGCGGCA TCAAGCACGA GGTGCTCAAC
GCGAAGAACC ACGCGCGCGA GGCGCACATC GTGGCCCAGG CCGGCCGGCT GGGCGCGGTC
ACGGTGGCCA CCAACATGGC CGGCCGCGGC ACCGACATCC AGCTCGGCGG CAGCCCCGAC
TTCATCGCCG ACGAGGCGCT GCGCGCCCGC GGCCTGTCCC CGGCGGAGAC GCCGGAGGAG
TACGAGGCGG CCTGGGACAG CGCGCTGGAG AAGGCCAGGG ACCAGGTCAA GGCCGAGCAC
GAGGAGGTCA CCGCGGTCGG CGGCCTCTAC GTGCTGGGCA CCGAGCGGCA CGAGAGCCGG
CGCATCGACA ACCAGCTGCG CGGCCGCTCG GGCCGCCAGG GCGACCCGGG TGAGTCCCGG
TTCTACCTGT CGCTGGGCGA CGACCTCATG CGGCGGTTCA ACGGCCCGAT GCTCGAGTCG
ATGATGACCA CGCTGCGGGT CCCCGATGAC CAGCCGATCG AGTCGAAGAT GGTCAGCCGG
GCGATCCTCT CGGCACAGAC CCAGGTCGAG CAGCAGAACT TCGAGGTCCG CAAGGACGTC
CTGAAGTACG ACGAGGTGCT CAACCGCCAG CGCACCGTCA TCTACGCCGA GCGGCGCAAG
GTGCTCGACG GCCAGGACCT GCACGTGCAG GTCCGTTCGA TGGTCGACGA GGTCGTCAGC
GCCTACGTCG ACGGGGCGAC CGAGATGGGT TACGCCGAGG ACTGGGACCT CGAGCAGCTG
TGGACCGGCC TCAAGGCCCT CTACCCGGTG GGCCTGGACC GCGACGAGCT GATCGACCGG
GTGGGCGACG GCGACCAGGC GGCGCTGACC GCCGACGTCC TCAAGAGCGA GCTGCTCGAC
GACGTCCACC GGGCCTACGA GGAGCGCGAG GCCACCCTCG GCGCGGAGGT CATGCGCGAG
CTGGAGCGCC GGGTGCTGCT GTCGGTGCTC GACCGCAAGT GGCGCGAGCA CCTCTACGAG
ATGGACTACC TGCGGGCCGG CATCCACCTG CGCGCGATGG CCAACCGCGA CCCGGTCGTG
GAGTACCAGC GCGAGGGCTA CGACATGTTC GTGTCGATGC TCGACGGCAT CAAGGAGGAG
TCGGTCGGCT TCCTGTTTAA CCTGGAGGTC AAGACCAAGG AGCAGCAGGA CGCCGAGGCC
CGCGCCAAGC AGGCCGAGGC CGAGGCCAAG GCGCTGGCCG TGGCACAGCA GGGGACGGCG
CGGGTGCGGG CCCGGCAGGC GGCGGCCGCG CAGGCCGCGG CGGCGCAGGC GGCCGCGGCG
GCTCCGGCTC CCGCTGCCCC GGCTCCGGCT CCCGCTGCCC CGGCTCCCGC CGCCCCGGCC
GCGGCACCGC CGGCTCGACA GGCCCTCGCC GAGCCGGACC TCGCCGAGTC GGCTCCTGCT
GCGCCCGCTC CCGCGCCGGC GCAGCCGGCC CCGGCGCCCG TCGAGCCGGC TCCCGCCGTG
GCCCAGGCGA CTGCGGCGCC GGTCGAGGTG GTCCCGGCGC CGGTCGAGGG CGACGGCGTC
GCGCCGGCCG CCGAGCCCGT CCCGGCGGCC CGTCGCACGG GGACGGCCCG GCGCGGTCGG
CACGCCGCGC CGGAGGACGT CGCCCCGGCA GCACCCGAGG TCGAGGAGCC GGTGTCCTCG
GGCACCGGTC CGGAGCTGTC GGTGAAGGGT CTCGACGACC CGCACCGCAG CGACCAGCTC
AGCTACTCGG CGCCTGGCCT GGACGCCTCG CCGCGGGAGA GCGGGCGGGT CAAGGCGGCG
AAGAGCGCGA CCGTGACCGG CACCAAGGAG CCGGCCCGGA ACGCGCCCTG CCCCTGCGGC
TCGGGCAAGA AGTACAAGGT CTGCCACGGA GCGCCCTCGC GCGCCTGA
 
Protein sequence
MFSKILRAGE GKILRRLNKI ADAVESLAEE TADLTDPELR ARTDEFKERL AEGETLDQLL 
PEAFAVVREA ATRTLGQRHF RVQVMGGAAL HLGNIAEMRT GEGKTLTGVL PAYLNALTDQ
GVHVVTVNDY LAKRDAEWMG RVHRFLGLSV GVILSGERPA QRREQYACDI TYGTNNEFGF
DYLRDNMAWN KSDLVQRGHH FAVVDEVDSI LIDEARTPLI ISGPAGDPAM HRWYTEFARL
APMMQRDVHY EVEEGKRTVA ITEEGVEFVE DQIGIENLYE AANTPLISFL NNALKAKELY
HRDQQYIVSN GEVLIVDEFT GRVLSGRRYN EGMHQAIEAK ERVQIKDENQ TLATITLQNY
FRLYEKLSGM TGTAQTEAAE LSQTYGLGVV PIPTNRPMVR EDRSDVIYKT EQAKFDAVID
DIAERHEAGQ PVLVGTASVE KSELLSRLLL QRGIKHEVLN AKNHAREAHI VAQAGRLGAV
TVATNMAGRG TDIQLGGSPD FIADEALRAR GLSPAETPEE YEAAWDSALE KARDQVKAEH
EEVTAVGGLY VLGTERHESR RIDNQLRGRS GRQGDPGESR FYLSLGDDLM RRFNGPMLES
MMTTLRVPDD QPIESKMVSR AILSAQTQVE QQNFEVRKDV LKYDEVLNRQ RTVIYAERRK
VLDGQDLHVQ VRSMVDEVVS AYVDGATEMG YAEDWDLEQL WTGLKALYPV GLDRDELIDR
VGDGDQAALT ADVLKSELLD DVHRAYEERE ATLGAEVMRE LERRVLLSVL DRKWREHLYE
MDYLRAGIHL RAMANRDPVV EYQREGYDMF VSMLDGIKEE SVGFLFNLEV KTKEQQDAEA
RAKQAEAEAK ALAVAQQGTA RVRARQAAAA QAAAAQAAAA APAPAAPAPA PAAPAPAAPA
AAPPARQALA EPDLAESAPA APAPAPAQPA PAPVEPAPAV AQATAAPVEV VPAPVEGDGV
APAAEPVPAA RRTGTARRGR HAAPEDVAPA APEVEEPVSS GTGPELSVKG LDDPHRSDQL
SYSAPGLDAS PRESGRVKAA KSATVTGTKE PARNAPCPCG SGKKYKVCHG APSRA