Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0839 |
Symbol | |
ID | 3905116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 980608 |
End bp | 981969 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878172 |
Product | allantoinase |
Protein accession | YP_479952 |
Protein GI | 86739552 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGC CCACCGCGGA CGTTCTGCCG GAGCGGCCAC AGGCGCTACG GTCACGGCGA GTCGTGCTGC CCGGGGGCGA GCGTCCGGCA GCCGTGCACG TCGCGGACGG CCGGATCGCC GCGATCACCG CCGCGGACGA GATCCCTCCC GGGACCCTGG TCACGGACCT CGGCGAGCTG GCTCTGCTTC CCGGCGTGGT CGACTCCCAC GTGCACATCA ACGAACCGGG GCGGACCGAG TGGGAGGGGT TCGCGACCGC GACCCGGGCC GCCGCGGCCG GTGGTGTCAC CACGATCATC GACATGCCGC TGAATTCCGT TCCGCCGACG ACCTCGCTCG CCGCCCTGGC CGCCAAACGT GCGGTGGCGG CCGGCCAGGT CGCCGTCGAC GTCGGCTTCT GGGGCGGCAT CATCGGCGCC GACGCCCGCA ACCTCGATGA TCTGGCCGCC CTGCACGGCG CCGGGGTCTT CGGATTCAAG GCCTTCCTGG CCCCGTCGGG GGTCGAGGAG TTCCCGCACG TCTCGATGGA CGTCCTCGGG GCCGCCGCCC GGCGCACCGC CCGGATGGGG GCCCTCACCG TCGTGCACGC CGAGGCGCCC TCCGTTCTCG CCCTGGCACC TCCCACCGTG GGGCGGGCCT TCGCCAGCTG GTTGGCGTCG CGCCCGCCGG CCGCGGAGAC CGAGGCCGTG GCCGCGCTGG CGGCGCTGTC CGCCGCCAGC GGTGCCCGCC TGCACGTGCT GCATCTGGCG GCGGCCGATG CCCTCGACGA CGTGCTCGCG GCGCGGGACG CGGGGTTGCC GATGACGGTG GAGACGTGCC CGCACTACCT GACCTTCACG GCCGAGGAGG TCCCCGACGG AGCTACCGTC TTCAAGTGCG CACCTCCCAT CCGGGACAGC AGGAATCTGG ACCGGTTGTG GGACGGGGTG GCCCAGGGCC TGTTCGCCGG GATCGTCACT GATCATTCGC CGTCCACTCC CGCGTTGAAG CAGATCGACG CCGGTGATTT CGCGGCGGCG TGGGGTGGCA TCGCGTCTGT CCAGATCGGG CTACCCGCGG TCTGGACTCA GGCCCGGGCG CGGGGGCACA CCCTCACCGA CGTCGTCGGT TGGATGTGTG CCGGACCGGC CGACCTGGTA GGACTGGCCG GCAAGGGGCG CATCGCGATC GGCGCTGACG CCGATCTGGT GATCTTCGAC GCGGATGCGT CGTTCCTGGT GGAGCCCTCG ATGCTGCGCC ACCGCCACCC GCTCACCCCC TATGCCGGAC GCGTGTTGAA CGGGGTGGTG CTGGCGACCT ATCTGCGGGG GCGGCGAGCG GACGGGGATC GTCCTCCGCG GGGGCGGCTA TTAGAGAGGT AG
|
Protein sequence | MNEPTADVLP ERPQALRSRR VVLPGGERPA AVHVADGRIA AITAADEIPP GTLVTDLGEL ALLPGVVDSH VHINEPGRTE WEGFATATRA AAAGGVTTII DMPLNSVPPT TSLAALAAKR AVAAGQVAVD VGFWGGIIGA DARNLDDLAA LHGAGVFGFK AFLAPSGVEE FPHVSMDVLG AAARRTARMG ALTVVHAEAP SVLALAPPTV GRAFASWLAS RPPAAETEAV AALAALSAAS GARLHVLHLA AADALDDVLA ARDAGLPMTV ETCPHYLTFT AEEVPDGATV FKCAPPIRDS RNLDRLWDGV AQGLFAGIVT DHSPSTPALK QIDAGDFAAA WGGIASVQIG LPAVWTQARA RGHTLTDVVG WMCAGPADLV GLAGKGRIAI GADADLVIFD ADASFLVEPS MLRHRHPLTP YAGRVLNGVV LATYLRGRRA DGDRPPRGRL LER
|
| |