Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3118 |
Symbol | |
ID | 3904244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3690085 |
End bp | 3691395 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880439 |
Product | aminotransferase, class V |
Protein accession | YP_482204 |
Protein GI | 86741804 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.646431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCCC GCCGATCCAA CCGGACGACA TGCTGTCTTC GACACGCTGT AGATGGTGCG GCAGCAGCGT ATCTCAGCAG CCGCGAACCT TCTCGCCTAA GGCCCCCGTC AGCGCCTGTG ACGCACGCCC CTCCCGGCCG TTCACGGCGT ACCGTTTCAC AGGTGCCGGC CTATCTCGAC CACGCGTCGA CCACGCCGCT GCACCCTGCC GCGCGCGAGG CTCTGCTAAT GGCTCTTCAG GATGGTTGGG CCGACCCGGC ACGGTTGTAT CGGGAGGGCC GCCGGGCGCG CATGCTGCTC GACGCGGCCC GCGAGACCGT CGCGGGCGTG CTCGGGGCCC GGCCGGACGA GATCAGTTTT CCCGCGAGCG GGTCGGCGGC GGCGCATCTG GCCCTGTTGG GCACGGCGGC GGCCCGGCGG CGTGCGGGTG ACGTCGTCAT GGTCAGCGCG GTCGAGCACT CCAGCGTCCT GCACGCCGCG CAGCGGCACG AACAGGCCGG TGGACGGGTC GTCAGGATTG GTGTGGATCA TCTCGGCCGG GTCGACCCCG CCGACTTCAC CCCCGTCGCC GGTACCGCCG TCGCCAGCCT CCAGCACGCC AACCACGAGG TCGGGACCAT CCAGCCGGTC GCCGAGGTCG CCGAACGGAT GCGCGCCGCC GGGGTGCCGC TGCACACCGA CGCCGCCGTG ACAGTCGGCC ACATCCCCGT CGACCTGGCC GACCTCGGGG TGGATCTGCT CACCGCGAGT GCCCACAAGT TCGGCGGACC ACCCGGGGTG GGCGTTCTCG CGGTGCGCAC CGGGACCCGC TGGCGCAGTC CCGGTCCCGT TGACGAGCGG GAGGGCGGTC GGGTTGCGGG CTATCCGAAC GTCCCCGCCG TCGTCGCCGC AGCCATGGCG CTGAGCGCCC GGGCGGGTGA ACTCGCCGCG GAGGCGCCCC GGCTCGCCGG CTACGTGGCC GAACTGCGCC GCCGGCTGCC CGAACTCGTC AACGGCGTGG AACTACTCGG CGATCCGGAC CGGGCGGCGA CGGTTCCGCA CATCAGCGCG TTCTCCTGCC TCTACGTCGA GGGCGAGGCG CTGCTGACCG AGCTCGACCG GACCGGAATC GCCGTGAGCT CCGGGTCGAG CTGCACATCC GACACCCTGA TCCCGAGCCA TGTCCTGGTC GCCATGGGCG CGTTGACGCA CGGCAACCTG CGGATATCCT TTGGCCGCGA GTCGACCCAG GCCGATCTCG ACGCCCTGCT GACGGCGCTG CCCGCAGCCG TACGCGCCGT GCGGGACCGG CTGGGCGCCG CCGGGCTTTG A
|
Protein sequence | MSPRRSNRTT CCLRHAVDGA AAAYLSSREP SRLRPPSAPV THAPPGRSRR TVSQVPAYLD HASTTPLHPA AREALLMALQ DGWADPARLY REGRRARMLL DAARETVAGV LGARPDEISF PASGSAAAHL ALLGTAAARR RAGDVVMVSA VEHSSVLHAA QRHEQAGGRV VRIGVDHLGR VDPADFTPVA GTAVASLQHA NHEVGTIQPV AEVAERMRAA GVPLHTDAAV TVGHIPVDLA DLGVDLLTAS AHKFGGPPGV GVLAVRTGTR WRSPGPVDER EGGRVAGYPN VPAVVAAAMA LSARAGELAA EAPRLAGYVA ELRRRLPELV NGVELLGDPD RAATVPHISA FSCLYVEGEA LLTELDRTGI AVSSGSSCTS DTLIPSHVLV AMGALTHGNL RISFGRESTQ ADLDALLTAL PAAVRAVRDR LGAAGL
|
| |