Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbro_1476 |
Symbol | |
ID | 8550823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gordonia bronchialis DSM 43247 |
Kingdom | Bacteria |
Replicon accession | NC_013441 |
Strand | - |
Start bp | 1537254 |
End bp | 1539014 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003272647 |
Protein GI | 262201439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0266278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAAG CAGGCCGGAA CACCGACGAG GCCACCCTGG CCTGCGCCGG TGAGTCCGAG CTGGCTGCTT CAGCCGCGGC GTTCACCCGC CTGGAACGCC AAGCCATGGC CCGCAAAATC CTTGCCGCCT ACCACCTCGG ACGCAAAGTG TTCGACCGCC ACATCGCCGC CGACTACGAC CGCTACCGCA AAAAGATGCC CGACAAAGCC GCCGTCCGCG ACGTTGCCTG GCACTTCGAC GTCCCCGCCG CCACCGCCGG ACGCTGGCTC GCCCTCGGGG AGCTGCTTTC CCACCTCCCC AACGTGCGGG CCGCATTCCT GGCCGGTGAC CTCACCGAAT CCCGCGCCTC ACTGATCGCG CACGCCCTGA CCGCGTTGGA TGAGAATCAG TACGACGCCG CCGAAGACGC CGCCCTCGGC TACGCCGCCG AAGCCACCAC CAACAGCGTG CTCTCCGAAC TGCTCGACGA GATGGTCATC GCGCTCGACC CCGACCGCGC CGAACAAGCC CGCCACGCGT ATGCCGCCCG CACCCAAGAC GTCCGATTCC GCAAAGACCT CTACGGACAC GTCAACATCT CAGCCACCCT CGACATCGGC GACGCCCGCA CCCTACAAGC CCGGCTCGCC GCGATGATCG AGCACTTCCT GTGCCGCAAC GACACCCGCC CACCCGGCCA ACAACGCGCC GACGCGCTCG CCGCCTTCAT CCACGACCGC ACCACCCTCG ACTGCCGCTG CGGCGACCCC CACTGCACAG CACCCGCCAA CCAACCACCC AACGACGACG AACCCGAACC CGGCACCCCC ACCAGCGATG CCAACGGTGA CGGCGCCGAG GACGCTCACA GCGACGAGGA CATCAGCAAC GAGGGCGTCA GCGACGCCGA AGACGCCGAC GGCAACGACG GTCACCGTGA GGACAGCGAT GGTGACGGTG ACGACAGCGA CGGATGCGTC GTCGATGCCG ATATCGACGC TGACGCAACC ACTCTCGACA GCGACGTGGA TGCACCACTG CCGGAGGAAC CCGACCAGCC CGACGACCCC GAACCCGAAC CCCAGCCCTC GACCGAGATC GCGCCGAGGC CGGTACGAGT CGTCGTCGAA GTCGTCACCG ACGCCCCCAC CCTGGCCGGG CTGACCGGCG ACATGGTGCC CTACATCAAA GGCTACGGCG CCATCGATCC CGCCCATGCC CGCGAACTCG CCGCCAACGG CACCTGGCAA GGCATGTACC GCGAAAGCCA ACGCTTCGCC CACCACACCG ACCCCACCCA ACCCGACCAG ACCTTCGGCC TGCTGAAACC CGGCCGGACC CGCACAGCCG GAACCATCAT CGTGCCCACC CACCTCACCA CCACCGACGG CGGCACCGGC AGCATCGTCG TACGCTCACG CCCACCCAAC CCCCAACCCA TCGACACCAC CGGACACGGT GGACTCACCA CACCACCACC CGGCGCCCTC GTGTATGCGC CTGCGGAGGT ACTGCGGCGC ACCATCGCCC ACACCGACCA CCACTGCCGC GGACCCTACT GCGGCCGACC CGTGGATCAA TGCCACCTCG ACCACATCGT GCCCTTCAAC CACCAAGACC CCCTCGCAGG TGGCTGGACC ATCGCCGAGA ACCTCCACCC CCTGTGCATC CCCTGCCACG AATTCAAACA CCTCGGCATC TGGACACCCA CCATGGCCAC CGGCCGGACG ATCATCTGGC GACACGTCGA ATCCGGCCAG ATCATCATCA CCTACCCGTG A
|
Protein sequence | MFEAGRNTDE ATLACAGESE LAASAAAFTR LERQAMARKI LAAYHLGRKV FDRHIAADYD RYRKKMPDKA AVRDVAWHFD VPAATAGRWL ALGELLSHLP NVRAAFLAGD LTESRASLIA HALTALDENQ YDAAEDAALG YAAEATTNSV LSELLDEMVI ALDPDRAEQA RHAYAARTQD VRFRKDLYGH VNISATLDIG DARTLQARLA AMIEHFLCRN DTRPPGQQRA DALAAFIHDR TTLDCRCGDP HCTAPANQPP NDDEPEPGTP TSDANGDGAE DAHSDEDISN EGVSDAEDAD GNDGHREDSD GDGDDSDGCV VDADIDADAT TLDSDVDAPL PEEPDQPDDP EPEPQPSTEI APRPVRVVVE VVTDAPTLAG LTGDMVPYIK GYGAIDPAHA RELAANGTWQ GMYRESQRFA HHTDPTQPDQ TFGLLKPGRT RTAGTIIVPT HLTTTDGGTG SIVVRSRPPN PQPIDTTGHG GLTTPPPGAL VYAPAEVLRR TIAHTDHHCR GPYCGRPVDQ CHLDHIVPFN HQDPLAGGWT IAENLHPLCI PCHEFKHLGI WTPTMATGRT IIWRHVESGQ IIITYP
|
| |