Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbro_2802 |
Symbol | |
ID | 8552166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gordonia bronchialis DSM 43247 |
Kingdom | Bacteria |
Replicon accession | NC_013441 |
Strand | + |
Start bp | 2989382 |
End bp | 2991037 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | protein of unknown function DUF222 |
Protein accession | YP_003273910 |
Protein GI | 262202702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0845411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACC AGCCACCATT ACCACCACAG CTCACCGACC TGATCGCCCA ACTCCACGCC CTCACCGACG AACTCCAGCA GGTGGACCTC ACCGCCTGCT CCGACGATGA CCTCATCGCT GCCGCCAACG CGCACGAACA AGCCATCACC CGATTGACCT ACGCCGGTGA CCGGCAACTG GTGGAGATCA CCGCGCGGGA TCTGCCGCGG CAGATGGGCT ACCGCTCGGT GCCGAACTTC TTCAACCATC GTCTGCGGAT CTCCAACCCG CAACGCCGCC GCACCCAACT GGCGGCCACC GCCACTCAAC GATCGTTGAC CGGCGACACC ACCGAACCGA GGTTTCCGGT GCTCGCCGAG GCGTTCGCCG CCGGCACCGT CGGCACCGGG CACATCACCA CGGTGCTCGA TGTGCTCGAC CAGATCCCCG CGTCGGTCCC CTTCGACAAG AAGACCGCGG CCGAGCGGCA GATGGTCGAC ATCGCCACCG AGTTCGCGCC TGCCGAGATC GCGATGGCCG GTCAACGCCT GTTGGGCCAT CTCGATCCGG ACGGGTCGCT GACCGGTGAT GCTGATCGGG CGCGGCGCCG CGGAGTGTGG ATCGGCAAAC CGCGTACTGA TGGGACCTCG CATCTGTCGG GCACCCTTAC CCCGGAATTG GCGGCCCGAC TGTCGATGAT GATGGCCGTG TTCGGCCAAC CCGGCCTCAA CAACCCCGAC GATCCCGACG CGCCGAACGG GGCCTACGAG AACGCTGATG CCGACCACGT CGCCGACGCC GCCTGCCGTG ACTACCGCAC CCCCACCCAA CGCAACCACG ACGCCCTCGA TGCGGCCCTG GAAGCCATGT TTGCCGACGG GACGCTGGGC ACCACCCACC GCGGTCTGCC GGTGCAGTTG ATCATCAAAG CCGACCTGAG CGATCTGATC GCCGAGACCG GGTACGGTGT CACCGCCACC AACACCCTGC TACCGATGAC CGATGTCATC CGGTTGGCCG CGCAGGCACA GCCGTGGTTG GCGGTGTTCG AGGACGCCAC CCCGATCCCC CTGTTCTTCG GGAAAGGCAA ACGGTTCGCC ACCCAAGCCC AACGCATGGT CAACTTCGCC CGCCCCGACG GACACGTCTG TTCCGCCCAC GGCTGTGATC AGCCGGCGGC GTATCTGGAA CTGCATCACG CCCAACTGGA CTGGGCCGAC GGCGGACTCA CCGACATCAT CGACATGACC GGCGCGTGTC CCAAACACAA TCGGATGGTC GGACCCAACC CCGGCCAATA CACCACCCGC ATGATCGGCG ACGGCCCCGA CCGCGGACGC TGCGGCTGGA CCCTCAACAC CCGCCCCGGC GCACCACCCA ACCCCGAACG GGTCAACCGC ACCCCCGACC TGGCCGCCGG ATTCACCAAA CACCTCGCGC AGGTACGCGC CGAAATCCAC GGCCCACCTG GCAATGGGCC GACGACCGAC GACCCGACAA CCGATGTTCC CGACATGGAC ACCGGACGAT CAGCGCTGCC CACTGAACAA GGCGCGATCG ACCAACAGGT CCGCGAACTC GCCCGCCTCC AGCTCCGTCA GACCATCAAC CCCGACTCCG TCGTCGAGAC CCGGCTCGCC GCACTCCTCG AAACGCACCT CGGACTCAAC AACTGA
|
Protein sequence | MSDQPPLPPQ LTDLIAQLHA LTDELQQVDL TACSDDDLIA AANAHEQAIT RLTYAGDRQL VEITARDLPR QMGYRSVPNF FNHRLRISNP QRRRTQLAAT ATQRSLTGDT TEPRFPVLAE AFAAGTVGTG HITTVLDVLD QIPASVPFDK KTAAERQMVD IATEFAPAEI AMAGQRLLGH LDPDGSLTGD ADRARRRGVW IGKPRTDGTS HLSGTLTPEL AARLSMMMAV FGQPGLNNPD DPDAPNGAYE NADADHVADA ACRDYRTPTQ RNHDALDAAL EAMFADGTLG TTHRGLPVQL IIKADLSDLI AETGYGVTAT NTLLPMTDVI RLAAQAQPWL AVFEDATPIP LFFGKGKRFA TQAQRMVNFA RPDGHVCSAH GCDQPAAYLE LHHAQLDWAD GGLTDIIDMT GACPKHNRMV GPNPGQYTTR MIGDGPDRGR CGWTLNTRPG APPNPERVNR TPDLAAGFTK HLAQVRAEIH GPPGNGPTTD DPTTDVPDMD TGRSALPTEQ GAIDQQVREL ARLQLRQTIN PDSVVETRLA ALLETHLGLN N
|
| |