Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0835 |
Symbol | |
ID | 3905112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 974379 |
End bp | 976265 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637878168 |
Product | amidohydrolase |
Protein accession | YP_479948 |
Protein GI | 86739548 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1001] Adenine deaminase |
TIGRFAM ID | [TIGR01178] adenine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGTGA TTTCGCCGTT GGTCATACGT GATCCCGTTC CGGGCGTGCC GCCATCGGAT GCAGACTCGG TGCTGACCGG CAGACGAGAG GCCGCCATGA ACGCCGAACG CTCCAGCCGG GGACCCCGAC TGATGACGGT GTCCGGCGGA GCGTCCCAGC CGACCGTCTC CTCTGCCACG CCCCACCGAT CGGCGACCCC GGCCCGCACG GCGGCGTCGG GCACGACCTC CACCACCGCT CCCGCGGTCG ACCCGCGTGA TCCGGCGGCG GCTCCTGGTC CGACCGGGCA GGCACCCGTG GTCGGCGCGG ACCTGCTGCG TCCGTCCGAG GCACAGCTCG CTCGACTGCG TCGGGTGGCC GCGGGTGAGG AGGAAGCCGA CCTCGTCATA CGCGGTGGGC TTGTCGTCGC CGTCCACAGC GGTGCCGTGG CCGCGCGGGA CATCCTGATC GTGGGGTGCT ACATCGCCGC GGTCACGAAG CCGGGAACGT TGGCGGGTCG GCGCTCGCTC GACGCCGCCG GCAGATTCGT CCTGCCGGCC TATGTCGACG CCGGCCTGCG TGTCGAGGAG ACGCTGCTCA CTCCGGGGGA ACTGGCTCGC CTGATCGTCC CGCGGGGAAC CGTCACCCTG GTGACCGACC CGGCCGTCCT GGTCGCGCTG GGCGGTCTGC GCGGTGTGGA TCTGGTGACG GGATCCTCGA CCCCGTTGCG GGTGCTGGTC CGAGCGGGGC AGGTGCCCGG GAGGCAGGTC CCCGGGACCG GGGCGCCCGC AGCCCCGGTC GTCGAGGTGC CTCCGATACC GCCCGCCTCC GCGCTGACCG CGCTGTCCTC GGTAGGGGCG GCGCCCGTCA AGTCGTCCGC CACGGAGGCC GGCGGTCTGC TGTGGGCGGC CGGCAGCGGG GCCGAGCCGG TCCGGGGCAG GGCCAGATCG GTCGCGGAGC TTGCCACGGT CGGGCATCTC GATCACGACG TGCGACTCGC GGTCGGTCGG GGGATGGGCC AGATCGACGC CATCCGGCGG TTCTCCCTGC TCCCCGCCCG CAGGCACGAT CTGGAACCCA CCCTGGGGTC AATCGCGCCC ACCCGCTTCG CTGATCTCCA GGTCGTCTCC TCCCTGGCCG GAACCGCGCC GCCCGACGTG GTGGTGGCGG GTGGCCGGAT CGCCGCGGAA TGCGGCCGAC CCCTGTTCGA CAACCTCGAC ATCTCGCCCG CCTGGGCCAC CAGCCGGACA CGACTCCCGG CGAACCTGCA CGCCGGCTCC TTCACATCAC TCGGCCTGCG ACGGTCCCAC CGGCAGGATG CCAGTGTGGT CGTGGTGAGC GTCGATCCCC CCCGCGACCC GGGGACGGCG CCGCCCGTCG GTGGACCAGC GCGGGCGTTG CGGATGACCG GGACGGGCCA GGTCTCCGGC CTCTCCCACC CGCGTGGACT CCGTACCGTG CGGGTGGAAC CCACCCTGCG GGACGGCTGG GCGGTAGCCG ATCCGTCTCG GGACCTTCTC AAGATCGCAA TTTTCGGTCG GGACGGCTCC TCGGACGAGA TCGACGTCGG GCTGCTCCGC GGGTGCGGGC TGACCCGGGG CGCGCTCGCG GTCACGACGG CGGAGCCGCC GGGGCATCTG ATCGTCGTCG GGGCGCGGGA CGACGACATG GTGACCGCCG CTCGGGCCCT CGAGGGCATG GGCGGGGGCT ACGTGGTGGT CGACCAGGGC TGGGTGCGGG CGGCGTGCGC GCTGCCCCTG CTCGGGGTGA TGAGTGACGC ACCCTGGGAG GCGGTGCTGG GAGAGCTGGC CGCGGTGGAC ACGGCCGCAG CCGATCTCGG CTGCCGGCTG CCGTTCCCGC TGCGCACGCT GGCCGGATGG GGCTGCGCCC TCTACACCCG GCCCTGA
|
Protein sequence | MGVISPLVIR DPVPGVPPSD ADSVLTGRRE AAMNAERSSR GPRLMTVSGG ASQPTVSSAT PHRSATPART AASGTTSTTA PAVDPRDPAA APGPTGQAPV VGADLLRPSE AQLARLRRVA AGEEEADLVI RGGLVVAVHS GAVAARDILI VGCYIAAVTK PGTLAGRRSL DAAGRFVLPA YVDAGLRVEE TLLTPGELAR LIVPRGTVTL VTDPAVLVAL GGLRGVDLVT GSSTPLRVLV RAGQVPGRQV PGTGAPAAPV VEVPPIPPAS ALTALSSVGA APVKSSATEA GGLLWAAGSG AEPVRGRARS VAELATVGHL DHDVRLAVGR GMGQIDAIRR FSLLPARRHD LEPTLGSIAP TRFADLQVVS SLAGTAPPDV VVAGGRIAAE CGRPLFDNLD ISPAWATSRT RLPANLHAGS FTSLGLRRSH RQDASVVVVS VDPPRDPGTA PPVGGPARAL RMTGTGQVSG LSHPRGLRTV RVEPTLRDGW AVADPSRDLL KIAIFGRDGS SDEIDVGLLR GCGLTRGALA VTTAEPPGHL IVVGARDDDM VTAARALEGM GGGYVVVDQG WVRAACALPL LGVMSDAPWE AVLGELAAVD TAAADLGCRL PFPLRTLAGW GCALYTRP
|
| |