Gene Gobs_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4021 
Symbol 
ID8755709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4218489 
End bp4220168 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003410957 
Protein GI284992403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTGC CCGGCCGGGA GAACCCGGTC CTCCTCCTCG GTCCCCTGTT GCGGCACGTC 
GACCCGGTGT CGGCCACCGT GTGGGTGGAG ACCGACCGGC CGTGCGAGGT GGAGGTGCTC
GGCCGCCGCG CCCGCACCTT CGGCGTGAGC GGCCACCACT ACGCCCTGGT CGTCGTCGAG
GGGCTGGAGC CGGGCAGCGC CACGCCCTAC GAGGTGCGTC TGGACGGCGA GCAGGTGTGG
CCGGAGGCCT CGTCGGCGTA CCCGCGCAGC CGGATCCGGA CGCCGGGCCG GCCCGGGCCG
TTCACCATCG CCTTCGGCTC CTGCCGCTAT GCCACCCCGA GCACCGTCGA CGCCGACGAG
GGCATCCCGC CCGACGCGCT GGACACCTAC GCCGCGCGGC TGACCGGTCA GCCCGAGGAC
ACCTGGCCCG ACGCGCTGGT GCTGCTCGGC GACCAGGTCT ACGCCGACGA GCTCACCCGC
GCGACCCGCC AGTGGCTCTC CCTGCGCCGG CAGCAACCGA CCCCGCGGGA CGCCCAGGTC
GACGACTTCG AGGAGTACAC CCGGCTCTAC GCCGAGTCCT GGAGCGATCC CCAGGTGCGC
TGGCTGCTGT CGACCGTGCC GTCGTCGATG ATCTTCGACG ACCACGAGAT GATCGACGAC
TGGAACACCT CCGCCGCCTG GCGCCGGAAG GTGACCGGGC AGGACTGGTG GACCCGCCGG
ATCTGCGGGG GCCTGGTCAG CTACTGGGTC TACCAGCACC TGGGCAACCT CAGCCCGCAG
GAGCTGGCCG ACAACGAGAC CTGGCAGGCG GTGCAGGAGC ACCGGGACGA CGCCGAGCCG
GTGCTGCAGG CCATGGCCGA GGCCGCCGAC CGCGACCCCC GCTCGGTCCG CTGGAGCTAC
GTGCGGCACT GGGGCGAGGC GCGGATGATC ATGGTCGACA GCCGGGCCGG CCGGGTGCTG
GAGGAGTCGA CCCGCGGGAT GCTCGACGAG GACGAGTTCG CCTGGGTGGA GGCGGCGATG
CGCCGCGCCG TCGACGAGGG GGTCGAGCAC CTGGTGCTGG GCACCTCGCT GCCCTGGCTG
CTGCCGCACG CCATCCACCA GGTGGAGCGG TGGAACGAGA CGCTCGTCCG GCGGCACCTG
GGCCGCCCCC TGGGCTGGGT CTCCGAACAG CTCCGGCAGG CCGCCGACCT GGAGCACTGG
GCCGCGTTCG GCGACTCCTT CGAGCGGCTG GGCCGCGCCC TGGTCGGGCT GGCCCGGGGC
GAGCAGGGCC GGGCGCCGGC GACCGCGCTG GTGCTCTCCG GCGACGTCCA CCACGCCTAC
GCCGCCGAGC TGGTGCAGCC CGACGGCCTG AGCACCCGGG TGCACCAGCT GACCGTGTCC
CCGCTGCACA ACCAGGCGCC GCACCCGATC CGCGTGGGCT TCCGCATCGG CTGGAGCCGC
TGGGCCCGCC GGCTCACCAC GGGGATGTCC CGCCTGGCCC GGGTCCGCCC CTCCGAGCTG
GACTGGGCCA AGCAGGCCGG GCCGTACTTC GGCAACCAGC TCGGCGAGCT GGTGCTGCAC
GACCGCGACG CCTCGTTCCG GCTGTTCGTC AGCGACCGGG ACGACGGGGG CCAGGGGCGG
CTGCGGCTGG TCGCCGACCT GCCGCTGTCG AACCCGGTCC CGGCGGAGGT GACAGGCTGA
 
Protein sequence
MPLPGRENPV LLLGPLLRHV DPVSATVWVE TDRPCEVEVL GRRARTFGVS GHHYALVVVE 
GLEPGSATPY EVRLDGEQVW PEASSAYPRS RIRTPGRPGP FTIAFGSCRY ATPSTVDADE
GIPPDALDTY AARLTGQPED TWPDALVLLG DQVYADELTR ATRQWLSLRR QQPTPRDAQV
DDFEEYTRLY AESWSDPQVR WLLSTVPSSM IFDDHEMIDD WNTSAAWRRK VTGQDWWTRR
ICGGLVSYWV YQHLGNLSPQ ELADNETWQA VQEHRDDAEP VLQAMAEAAD RDPRSVRWSY
VRHWGEARMI MVDSRAGRVL EESTRGMLDE DEFAWVEAAM RRAVDEGVEH LVLGTSLPWL
LPHAIHQVER WNETLVRRHL GRPLGWVSEQ LRQAADLEHW AAFGDSFERL GRALVGLARG
EQGRAPATAL VLSGDVHHAY AAELVQPDGL STRVHQLTVS PLHNQAPHPI RVGFRIGWSR
WARRLTTGMS RLARVRPSEL DWAKQAGPYF GNQLGELVLH DRDASFRLFV SDRDDGGQGR
LRLVADLPLS NPVPAEVTG