Gene Gobs_4410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4410 
Symbol 
ID8756104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4644226 
End bp4645560 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003411336 
Protein GI284992782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.938013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACG TCTGGCTCAA CATCCTGATG GTCGTCGTCT TCGTCCTGAT AGGCGGCGTC 
TTCTCGGGGG CGGAGATCGC CCTGGTGTCC CTGCGCGAGT CGCAGGTGCG CGCGCTGGCC
GAGTCGGGGG GACGCCGCGG CCAGGCGGTG CAGCGGCTGC TCAGCGACCC CAACCGCTTC
CTCGCCGCCG TCCAGGTCGG CGTCACCCTG GCCGGCTTCT TCTCCGCCGC GTTCGGTGCC
AGCACGCTGT CCCAGCCGCT CGGCGAGTGG TTCATCACCC TCGGCATGCG CGCCGGGCTG
GCCGACCCGC TGGCCTTCGT GCTGGTCACC ATCGCGATCA GCTACCTGTC CCTGGTGGTC
GGCGAGCTGA CCCCCAAGCG CCTGGCGCTG CAGCGTGCCG AGGGCTTCTC CCTGCTCGTC
GCCGCGCCGC TCAACGCGAT CGCCAAGCTG TCGCGCCCGG TCATCTGGCT GTTGTCGAAG
TCGACCAACC TGCTCGTCCG GCTGGTGGGC GGGGACCCGA CCGCCAGCGG TGAGTCGATC
AGCCAGGAGG AGCTGCGCGA CCTGGTCACG GCGCACGAGT CGCTGAGTTC CGACGAGCGC
CGGCTCATCG GCGAGGTCTT CAGGGCCGGC GACCGCGAGG TGCGCGAGGT CATGACCCCG
CGCACCGAGG TGGACTTCCT CGACGCGTCG ATGACCGCCA GCCGGGCCGC CAAGCAGGTG
CACGACTCCA GCCACTCCCG CTACCCGGTC GTCGGCCGCG ACGAGGACGA CGTCCTGGGC
TTCGTGCACG TCCGCGACCT GTTCCTGCCC AACCACCCGG CCGGGCGCGC GGCGACCGTC
GGCGACCTGG TCCGCGAGGT CAAGCGGCTG CCGGGCACCG CCGGCGTCCT CACCGCGCTG
TCGGAGATGC GGCGGGAGAA CCAGCACCTG GCGATCGTCG TCGACGAGTA CGGCGGCACC
GACGGGATCG TCACCCTCGA GGACCTCATC GAGGAGGTCA TCGGGGAGAT CTACGACGAG
TACGACGAGG GCGTCGCCGA CGGCGGGGAC GAGCGGCCGG ACGGCCCGCA GGAGCTCGAC
GGGCTGCTCA ACCTCGACGA CTTCCGCGAG GCGACCGGCC TGCAGCTGCC CGAGGGGCCC
TACGAGACCG TCGCCGGCTA CGTGCTCGCC GAGCTCGGCC GGCTGCCCGT CGTCGGCGAC
AGCGTCGAGG TCGAGGGGCG CACGCTCACC GTCCTGGAGC TCGACGGACG GCGGATCGCG
CGGATCTCGG TCAGCCGCGC CCCGCAGCCC GAGGTCGACC CGTCCCAGGT GCCGACCACC
ACGATCGGCA CCTGA
 
Protein sequence
MSDVWLNILM VVVFVLIGGV FSGAEIALVS LRESQVRALA ESGGRRGQAV QRLLSDPNRF 
LAAVQVGVTL AGFFSAAFGA STLSQPLGEW FITLGMRAGL ADPLAFVLVT IAISYLSLVV
GELTPKRLAL QRAEGFSLLV AAPLNAIAKL SRPVIWLLSK STNLLVRLVG GDPTASGESI
SQEELRDLVT AHESLSSDER RLIGEVFRAG DREVREVMTP RTEVDFLDAS MTASRAAKQV
HDSSHSRYPV VGRDEDDVLG FVHVRDLFLP NHPAGRAATV GDLVREVKRL PGTAGVLTAL
SEMRRENQHL AIVVDEYGGT DGIVTLEDLI EEVIGEIYDE YDEGVADGGD ERPDGPQELD
GLLNLDDFRE ATGLQLPEGP YETVAGYVLA ELGRLPVVGD SVEVEGRTLT VLELDGRRIA
RISVSRAPQP EVDPSQVPTT TIGT