Gene Gobs_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4102 
Symbol 
ID8755793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4313233 
End bp4314627 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative pep2 protein 
Protein accessionYP_003411038 
Protein GI284992484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.938013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCC TGACCGGACT GTTCCGGGAC TGGATGCCGT CCCAGCGCTG GTTCGGTGGC 
AAGGGCCGGG AGTGGGCCGG CGTCGAGGAG GAGAGCTTCT TCCTCGACCG CTCGCACCCC
GTCCTGTCCA TCCACCGGGT GACGGTGACC TACACCGACG GCGCGACCGA GACCTACCTG
GTGCCGCTGT CGTGGCGTGA CCACGCCGTG GAGGAGCTGG CCTTGGCGCA CATCGGCACC
GTCGCGCACG AGAACGGCGA GAACCACGCC TACGACGCCA TGCGCGACCG CGAGGCCACC
GCCAGCTGGC TCACCCACCT CGCCGACGGC GCCACGGTGG GCCCGATGCG GTTCGTGCCG
GGCGAGGGTG CCGAGATCCC GGCCGGGCTC CCCGGCGACA TCGTCTCCAC CGAGCAGAGC
AACACCTCGC TGGTGTATGG GCAGGAGGCC ATCCTCAAGC TGTTCCGGCG GCTCGAACCC
GGCCTCAACC CGGACGTCGA GGTCCACAGC GCCCTGCGCC GGACCGACAA CCCGCACATC
GCCCCGCTGC TCGGTCACGC CGAGATCGAC CACGACCGGG ACGCCGGCAC CCCGCCGGCG
ACGGTCTTCA TGCTGCAGCG GTTCGTGCCC AACGCCAGCG ACGGGTGGCT GCTGGCCACC
GCGAGCGTGC GCGACCTCTA CGCCGAGGGC GACCTGCACG CCGACGAGGT GGGCGGGGAC
TTCGCCGCCG ACAGCGAGCG GCTCGGCGCG GCCACCGCCT CGGTGCACGC CGACATGGCG
CAGGTGCTGC CGACCGAGGA GGCCGACCGC GACTGGTTCA CCACCGTCGC CCGGCAGATG
ACCGAGCGGC TCGACGCCGC GATCGAGGTC GTCCCGCAGC TGGCCGAGCA CGCCGACGGG
TTGCGTGCGG TGTACGCGGC CGTGGCGGAG AACCCCGAGC CCGTCGTCCG CCAGCGGGTG
CACGGGGACC TGCACCTGGG CCAGGTGCTG CGCACCGCCA CCGGGTGGAT CGTGCTCGAC
TTCGAGGGCG AGCCCGCCCG CCCGCTGGCC GCGCGCCGCG AGCTGGACAG CCCGATGCGC
GACGTGGCCG GGATGCTGCG CAGCTTCGAC TACGCCGCCC GCCACATGCT CGTGGAGCAG
CCGGGTGATC AACAGCGCGC GTACCGCGCG CAGGAGTGGG CAGAGCGCAA CCGGAGCGCG
TTCTGCGCGG GCTACGCGGC CGCGAGCGGG ATGGACGCCT GCGGCAACAG CCCGTTGTTG
CGCGCGTTCG AGGCGGACAA GGCCGTCTAC GAGTGCGTCT ACGAGGCGCG CAACCGCCCG
CACTGGCTGA TGATCCCGCT GCAGTCGCTG TCCCGCCTCA CCGCCGCGGA CCAGCGCGGC
GAGCCACGAC CCTGA
 
Protein sequence
MNALTGLFRD WMPSQRWFGG KGREWAGVEE ESFFLDRSHP VLSIHRVTVT YTDGATETYL 
VPLSWRDHAV EELALAHIGT VAHENGENHA YDAMRDREAT ASWLTHLADG ATVGPMRFVP
GEGAEIPAGL PGDIVSTEQS NTSLVYGQEA ILKLFRRLEP GLNPDVEVHS ALRRTDNPHI
APLLGHAEID HDRDAGTPPA TVFMLQRFVP NASDGWLLAT ASVRDLYAEG DLHADEVGGD
FAADSERLGA ATASVHADMA QVLPTEEADR DWFTTVARQM TERLDAAIEV VPQLAEHADG
LRAVYAAVAE NPEPVVRQRV HGDLHLGQVL RTATGWIVLD FEGEPARPLA ARRELDSPMR
DVAGMLRSFD YAARHMLVEQ PGDQQRAYRA QEWAERNRSA FCAGYAAASG MDACGNSPLL
RAFEADKAVY ECVYEARNRP HWLMIPLQSL SRLTAADQRG EPRP