Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3133 |
Symbol | |
ID | 3903930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3704067 |
End bp | 3705662 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637880454 |
Product | leucyl aminopeptidase |
Protein accession | YP_482219 |
Protein GI | 86741819 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.100377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA TCGTGCCCAG TACCGCCGCT CTCGCCGATC TCGATGTGGA CGCGGTGGTG ATCGCCACCG CCACCGGCGA CGAGGGGCTC CTCGTAGCCG GTGGCGCGGC GGACCTGGAC GCCGCGCTCG GCGGCAGGCT GACGCAGGTG CTTGCCTCGC TGGGCGCGAC CGGCAAGGCC GGCGAGACGG TCCGCTTCGC CACCCTCGGT ACCGTGCCCT GCGCGACGGT TCTCGCGGTC GGGCTGGGCC CCCTCGCCAC GTCGTCCACG CCGATGTCGT CCACGCCGAT CGGCACCGAG GCACTACGCC GGGCGGCCGG GGTCGCGGTG CGCTCCCTCG CCGGGACCGC GCGGGTGGCC GTCGCTCTGG CGGCCGCGCC GGGCGCGGTC ACGTCGGAGT CGGTGCGTGC CGTGGCCGAA GGCGCGTTGC TCGGCACCTA CTCCTACGAC GGGCTGCGCA CCACGTCGGC GAACGGCCGC CCCCGCCCCG TCGAGGAGCT GACGGTGCTC GTCGACGAGC AGAGCCTCCC GACAGCGGAA GAGGAACTGC GCCGGGCCAC CGTCGTCACG GATGCGGTCA CCCTGGTCCG CGATCTGGTG AACACGCCGC CGAGCCATCT GTCCCCCGCC CTGCTCGCCG ACATCGCCGT CGAGCGGGCC ACCGCCGCCG GCGTGACGGT CGAGGTCCTC GACGAGGTGA CGCTCGCCGA GGGCGGCTTC GGAGGCCTGC TCGGTGTCGG CCAGGGATCC GCGAATCCAC CGCGGCTGGT GCGGCTGGAA TGGCGCGGCG GTGGATCCGA GCGGCCGGCG CTGGCGCTGG TCGGCAAGGG CATCACCTTC GATTCCGGGG GGTTGTCCCT GAAGCCGTCG ACCTCCATGG AATGGATGAA GACCGACATG GCGGGCGCGG CCGCGGTGCT CGGCACCGTC ATCGCCGCGG CCCGGCTCAA GCTCCCGATC ACTCTCACCG GCTGGATGCC CTGCGCGGAG AACATGCCCT CGGGCGACGC GATCCGCCCC TCCGACGTTC TCACCCTGCG CGGCGGCACC CGGGTGGAGG TGCTCAACAC CGACGCCGAG GGCCGGCTGG TGCTCGCCGA CGCCCTCGTC CGCGCCAACG AGGAGTCACC GGACCTGATC GTGGACGTGG CCACCCTGAC CGGCGCACAG ATCGTCGCCC TCGGCCACCG GACGAGCGGA CTGATGGGTC GTGCGGCCGC GGTGGACGCG GTCGCGTCGG CGGCGGCCGC CGCCGACGAG AGCGTCTGGC CGATGCCCCT TCCGCCCGAT CTGCGCAAGG GACTGGACTC GACGGTGGCC GACATCGCCA ACGTGCCACC GGGAGGCAAC CGAGACGGCG GCATGCTGGT CGCCGCGCAC TTCCTCGCCT CGTTCGTCCC CGAGAAGGTG TCCTGGGCGC ACATCGACAT CGCCGGTCCC TCCTGGAACG GTGGAGAGCC CTACGGCCAC ACCCCCAAGG GAGGCACCGG CATGATCGTA CGCACGCTCG TCCAGCTGGC GCAGGACCGC GCGACGACGA CGGCCTCCCC CGGTTCGGCC ACCGGTTCCC CGGCTGAGTC CCCGGCCGAG GGCTGA
|
Protein sequence | MTTIVPSTAA LADLDVDAVV IATATGDEGL LVAGGAADLD AALGGRLTQV LASLGATGKA GETVRFATLG TVPCATVLAV GLGPLATSST PMSSTPIGTE ALRRAAGVAV RSLAGTARVA VALAAAPGAV TSESVRAVAE GALLGTYSYD GLRTTSANGR PRPVEELTVL VDEQSLPTAE EELRRATVVT DAVTLVRDLV NTPPSHLSPA LLADIAVERA TAAGVTVEVL DEVTLAEGGF GGLLGVGQGS ANPPRLVRLE WRGGGSERPA LALVGKGITF DSGGLSLKPS TSMEWMKTDM AGAAAVLGTV IAAARLKLPI TLTGWMPCAE NMPSGDAIRP SDVLTLRGGT RVEVLNTDAE GRLVLADALV RANEESPDLI VDVATLTGAQ IVALGHRTSG LMGRAAAVDA VASAAAAADE SVWPMPLPPD LRKGLDSTVA DIANVPPGGN RDGGMLVAAH FLASFVPEKV SWAHIDIAGP SWNGGEPYGH TPKGGTGMIV RTLVQLAQDR ATTTASPGSA TGSPAESPAE G
|
| |