Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5695 |
Symbol | ureC |
ID | 5674021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6913101 |
End bp | 6914828 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244548 |
Product | urease subunit alpha |
Protein accession | YP_001509951 |
Protein GI | 158317443 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGAGC TGGAGCGGTC CCGGTACGCC ACCCTGTACG GGCCGACGGT GGGCGACCGG ATCCGCCTCG CGGACACCGA CCTGTTCATC GAGGTCACCG ACGACCTCAG CCGCGGCCCG GGCGGCACGG CCACCGGCGA CGAGGCGGTG TTCGGCGGCG GCAAGGTCAT CCGCGAGTCG ATGGGCCAGG CCCGCGCCAC CCGCGCGCAG GGCGCACCCG ACCTGGTGAT CACCGGTGCG GTCGTGCTCG ACCACTGGGG AGTCGTCAAG GCCGACGTCG GCATCCGGGA CGGACGGATC AGCGCGCTGG GCAAGGCCGG CAACCCCGAC ACCATGGACG GCGTCCACCC GGATCTGGTG ATCGGCCCCG GCACCGAGAT CATCGCGGGC AACGGGAAGA TCCTCACCGC GGGCGCGGTC GACACCCACG TCCACTTCAT CTGCCCGCAG CAGGTTCCCG AGGCCCTCGG CACGGGCGTC ACCACGCTCA TCGGCGGCGG CACCGGGCCG GCGGAGGGCA CCAAGGCGAC GACCGTCACC GCGTCGCCGT GGAACCTGCA CCGGATGATG TCCGCGATGG ACGGCTGGCC GGTCAACGTC GCCCTGCTCG GCAAGGGCAA CACGGTCAGC GAGGACGCGA TGTGGGAACA GCTGCGCGCC GGCGCGGCCG GCTTCAAGCT GCACGAGGAC TGGGGCACCA CCCCGGCGGC CATCGACGCC TGCCTGCGCG TCGCCGACGC CTCGGGGGTA CAGGTGGCAC TGCACTCCGA CACCCTCAAC GAGGCCGGGT TCGTCGAGGA CACCCTCGCC GCGATCGCCG GCCGGGCGAT CCACGCCTAC CACACGGAGG GTGCCGGCGG CGGGCACGCG CCGGACATCA TCACCGTCGC CTCGTTCGCG AACATCCTGC CGTCGTCGAC GAACCCCACC CGGCCGCACA CGGTCAACAC CCTCGACGAG CACCTCGACA TGCTGATGGT CTGCCATCAT CTGAACCCGT CGGTGCCCGA GGACCTGGCG TTCGCGGAGA GCCGCATCCG GCCGTCCACG ATCGCGGCCG AGGACATCCT GCACGACCTG GGCGCGATCT CGATGATCGG CTCGGACTCG CAGGCGATGG GCCGCGTCGG CGAGGTCGTC ACGCGGACCT GGCAGACCGC GCACGTCATG AAACGCCGCC GCGGCGCACT GCCCGGCGAC ACGGTCGCCG ACAACAACCG GGCCCGGCGC TACGTCGCCA AGTACACGAT CTGCCCGGCG GTGGCGCACG GCCTGGACGC CGAGATCGGG TCGGTGGAGG CCGGGAAGTT GGCCGACCTG GTGCTCTATG AGCCGGCCTT CTTCGGGGTG CGGCCGTCGC TGGTCCTCAA GGGCGGTTTC ATCGCGTGGG CGGCGATGGG CGACGCGAAC GCCTCGATCC CGACCCCGCA GCCGGTGCTG CCGCGGCCCA TGTTCGGCGC CGCCCCCGGC CCGGCCGCGG CGAGTTCGCT GATGTTCGTC GCGCCCGCGG CGCTGCAGGA CGGCCTCGAC GAGCGGCTGG GCCTGGCGAA GCCGATGGTC GCCACGGCGG ACGTCCGCCG GCGGGGCAAG GCGGACCTGC CGGAGAACAC CGCGACGCCG GACATCCGCG TCGACCCGGA CACCTTCACC GTGCGCATCG ACGGCGAGGC GGTGGAGGCG GCTCCGGCCG CCGAGCTGCC CATGGCCCAG CGGTATTTCC TCTTCTGA
|
Protein sequence | MSELERSRYA TLYGPTVGDR IRLADTDLFI EVTDDLSRGP GGTATGDEAV FGGGKVIRES MGQARATRAQ GAPDLVITGA VVLDHWGVVK ADVGIRDGRI SALGKAGNPD TMDGVHPDLV IGPGTEIIAG NGKILTAGAV DTHVHFICPQ QVPEALGTGV TTLIGGGTGP AEGTKATTVT ASPWNLHRMM SAMDGWPVNV ALLGKGNTVS EDAMWEQLRA GAAGFKLHED WGTTPAAIDA CLRVADASGV QVALHSDTLN EAGFVEDTLA AIAGRAIHAY HTEGAGGGHA PDIITVASFA NILPSSTNPT RPHTVNTLDE HLDMLMVCHH LNPSVPEDLA FAESRIRPST IAAEDILHDL GAISMIGSDS QAMGRVGEVV TRTWQTAHVM KRRRGALPGD TVADNNRARR YVAKYTICPA VAHGLDAEIG SVEAGKLADL VLYEPAFFGV RPSLVLKGGF IAWAAMGDAN ASIPTPQPVL PRPMFGAAPG PAAASSLMFV APAALQDGLD ERLGLAKPMV ATADVRRRGK ADLPENTATP DIRVDPDTFT VRIDGEAVEA APAAELPMAQ RYFLF
|
| |