Gene Gobs_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4224 
Symbol 
ID8755918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4436964 
End bp4438331 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003411157 
Protein GI284992603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG TCCCGTTCGG CTTCGGTGTC CCCGACCGCG ACCCCGAGCG TCGCGACCAG 
TCCGGGTCAG GCCCCGGGAA CGACCCCTTC GGATTCGGCG CTCTCTTCGG TGGCGCCGGT
GGGGGGACGC CGGACGAGCT GCTCGCCAAG ATGCCGCTGT TCGCCGAGCT GCAGAAGCTG
ATGACCTGGT CCGGCGGCCC GGTCAACTGG GACCTGGCGC GGCAGGGGGC GATCAGCTCG
CTGGCCGCCG GTTCGCAGCC GTCCTCCGAC GCCGAGCGCG CCGCCGTCGC CGATGCTCTG
CGCCTGGCCG ACCTGTGGCT CGACCAGGTC ACCGAGCTGC CCTCCGGCGT GGACCGGCCG
CTCGCCTGGT CCCGCGTGGA GTGGGTGGAG CAGACGCTGC CCGCCTGGAG CACCCTCATC
GACCCGCTCG CCGAGCGCGT CGTCGGCGCC ATGACCAGCG CCCTCCCCGC CGAGGCGGCC
GCGATGGCCG GCCCGCTCGC CGGGATCATG GGCCGGATGG GCGGCCTGAT GTTCGGCGCC
CAGGTCGGCC AGGCGCTCGG CCGGCTGTCC GGCGAGGTCC TCACCAGCGG CGAGATCGGC
ATCCCGCTGG CCCCGGCCGG CGCCGGCGTC CTGCTGCCGC AGAACGTCGC CGAGTTCGCG
GCCGGCCTCG ACCGCCCCGC CGACGAGGTG CGACTGTTCC TCGCGTTGCG CGAGGCGGCC
TCGCAGCGGC TGTTCGTGCA CGTGCCGTGG CTGCGCCAGC AGCTGCACGA CGCCGTCCAC
GCGTACGCGC GCGGCATCCA CGTCGACCGC GAGGCGATCG AGCGCGGCAT CAACGAGGCG
ATGGGTTCGA TGGGCGGGAT CGACCCGACC AACCCCGAGG GCATCCAGGC GCTGCTGGGC
AGCGGGCTGC TGGAGCCCGA GGAGACCCCC GAGCAGCAGG CGGCGCTGCG CCGGCTGGAG
ACGCTGCTCG CGCTCGTCGA GGGCTGGGTC GACAGCGTGG TGGCCGCGGC CGCCGGCGAC
CGGCTGCCCG GGCACGGAGC GCTGGCCGAG ACGATGCGCC GCCGTCGCGC CTCCGGCGGG
CCGGCCGAGC AGACCTTCGC GACCCTGGTG GGCCTGGAGC TGCGGCCGCG GCGGCTGCGC
GACGCCGCCA CCGTGTGGGG CGCGATGGCC CAGCAGCACG GCAACGCCGA GCGCGACCGG
CTGTGGTCGC ACCCGGACCT GCTGCCGACG TCGGACGACC TGGACGAGCC GCTCGACTTC
GTCGCCCGCC AGGGTGCGGA CGACGAGCTG CGCAGCCTCA CCGCCGACGA CGCCCAGGAG
CCCGGCACCC AGAAGCCCGA CACCGACGGC CGCGACAGCG GGGACTGA
 
Protein sequence
MSDVPFGFGV PDRDPERRDQ SGSGPGNDPF GFGALFGGAG GGTPDELLAK MPLFAELQKL 
MTWSGGPVNW DLARQGAISS LAAGSQPSSD AERAAVADAL RLADLWLDQV TELPSGVDRP
LAWSRVEWVE QTLPAWSTLI DPLAERVVGA MTSALPAEAA AMAGPLAGIM GRMGGLMFGA
QVGQALGRLS GEVLTSGEIG IPLAPAGAGV LLPQNVAEFA AGLDRPADEV RLFLALREAA
SQRLFVHVPW LRQQLHDAVH AYARGIHVDR EAIERGINEA MGSMGGIDPT NPEGIQALLG
SGLLEPEETP EQQAALRRLE TLLALVEGWV DSVVAAAAGD RLPGHGALAE TMRRRRASGG
PAEQTFATLV GLELRPRRLR DAATVWGAMA QQHGNAERDR LWSHPDLLPT SDDLDEPLDF
VARQGADDEL RSLTADDAQE PGTQKPDTDG RDSGD