Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3948 |
Symbol | |
ID | 8755633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4139744 |
End bp | 4140955 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1006 |
Protein accession | YP_003410887 |
Protein GI | 284992333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.751345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGAGCAC CCGTTGCCGA CCGGCTGCCG GCCGCGCTGG CCCGCCGCAT CGCCCTGGCC GCGCAGGGCT TCGCCGACCC CCGGCCCGCG GGCGCCGTCG ACGGGCGGCA GCTGCGCCGG ATGACGTCCC GGCTGGCGGT GCTGCAGATC GACTCGGTCA ACGTGCTGTC CCGCGCCCAC TACCTGCCAG CCTTCAGCCG CCTCGGCCCC TATCCGCGCG AGACGCTCGA CGACCTCGCC GACCGCCACA GGGAGCTGTT CGAGTACTGG GCGCACGAGG CCTCGCTGCT CCCCGTCCGG CTGCACCCAC ATCTGCGCTG GCGGATGGCG GCGGCCGAGG AGCACGCCTG GAGCTCGATG GTCCGGATCC GGCGCGAGCG TCCGGGATTC GTCACCGAGG TCCTCGAGCG GGTGCGGGAG ACCGGTCCGC TCAAGGCCAG CGACCTCGCC GAGCCGCGAC CGGACCGGCC CGGGAGCATG TGGAACTGGC ACGCCGGCAA GGTCGCCCTC GAGTGGCTCT TCTACACCGG CGTCGTCACC ACCCGAGGCC GGACGGCGGG CTTCGAACGC GTCTACGACC TCACCGAGCG GGTGTTGCCC GCAGCCGTGC TCCAGGCTCC CACCCCGGAG CCGGCCGACG CCGTCCGCGA GCTGGTGCGC ACCGCCTCGC GTGCCCTGGG CGTGGCCACC GAGCGCGACC TGCGCGACTA CTTCCGGCTG CGTCCACCGG CCGCGCGGGC GGCGATCGCC GAGCTGGCCG ACGCGGGCGA GCTCGTCCCC GTCCAGGTGA CCGGCTGGGG TGCGCCGGCG TGGCTGCACC CCGAGGCACG CCGTCCCCGC TGGGTGCGGG CACGGGCGCT GGTGAGCCCC TTCGACTCCC TGGTCTGGGA GCGGCCGCGG GTGGAACGCA TCTTCGGCTT CCGGTACCGG CTGGAGATCT ACACCCCGGC GGCCCAGCGG GTGCACGGCT ACTACGTCCT GCCGTTCTTG CTCGACGACC GGCTGGTGGC GCGGGTGGAC CTCAAGGCCG ACCGGCATGC CGGGGTGCTG CGGATCCAGT CGGCGTTCGC CGAGGAGGGC GTGGACCGTG CCCAGGTGAC CGCTGCCCTC GCCGAGGAGC TGGCGCTCAT GGCCGGCTGG ATGCAGCTGG GTGCCGTCGT CGCGGGCGAG CGCGGTGACC TGGCTGCCGA GCTCGCCGCC GTCGTGGGCT GA
|
Protein sequence | MGAPVADRLP AALARRIALA AQGFADPRPA GAVDGRQLRR MTSRLAVLQI DSVNVLSRAH YLPAFSRLGP YPRETLDDLA DRHRELFEYW AHEASLLPVR LHPHLRWRMA AAEEHAWSSM VRIRRERPGF VTEVLERVRE TGPLKASDLA EPRPDRPGSM WNWHAGKVAL EWLFYTGVVT TRGRTAGFER VYDLTERVLP AAVLQAPTPE PADAVRELVR TASRALGVAT ERDLRDYFRL RPPAARAAIA ELADAGELVP VQVTGWGAPA WLHPEARRPR WVRARALVSP FDSLVWERPR VERIFGFRYR LEIYTPAAQR VHGYYVLPFL LDDRLVARVD LKADRHAGVL RIQSAFAEEG VDRAQVTAAL AEELALMAGW MQLGAVVAGE RGDLAAELAA VVG
|
| |