Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3497 |
Symbol | |
ID | 3905231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4170290 |
End bp | 4171909 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637880819 |
Product | Xaa-Pro aminopeptidase |
Protein accession | YP_482579 |
Protein GI | 86742179 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.434008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.658435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCC AGTCCCACCA AAGCTCACCT CCACCGACCG GCACGGGGGT GCCCGAGAGC GCGCCGGCCG GGCGTGATCC GAACCTCATC ACGACCGACG AGCCGGGCGT TGTCCCCACC GGGGCCAGTG AACCTCCGGC ATCGGATGAT TCCGTGCGCG AACCCGGACA GGCCAACCAC GACGAGGAAC CACACGCGGC GTTCCGGCGA TTTATGTCGA CCGGGTGGGC GCCGATTGAC GACGCTGTGC CATCCCGCGA TGACTGCGCA CCCTACACCA CGAAACGTCG CGCCTTGTTG GGTTCGCGGT TCCCGGCGGA CGCGCTTGTG GTGCCGAGCG GCGGTCTGCG GGTACGGGCA AACGACACAG ATTTTCCGTT CCGTCCCGGA AGTGACTTCT TCTGGCTGAC CGGCTGCCAT GAACCCGATG CCGTGTTGGT TCTGCTACCC AACGCGGCCG GTGAGCATGA TGCGGTGCTC TATGTCGCCG GGCGGTCCGA CCGGTCGAGC TCCGCCTTCT ACACCGACCG GCGATACGGG GAACTGTGGG TCGGTCCCAG ACCGGGGGTT CGGGAGACCA CGGCACAGCT GGAGATCGAA TGCCGGCCGC TCACGGAGCT CGCGGACGCG CTCGCGCGGT TGGCACCGGG AAACACCCGG GTACTACGCG GAGTCGATCC GGTCGTGGAC GGTTCGGTGC TGCGCTGGTC CGGCCGGGGT GGACCAGGAA CCGACCGCGA TCTGGCCCTC GCACAGGCCC TTTCCGAGCT GCGGCTGCTC AAGGACGACT TCGAGATCGC CCGATTGGAC GAGGCGATCG CCGCGACCGT GCGTGGCTTC ACCGAGTGCG TGGGAGAAAT CGGTCGGGCG ATGGACCTGC CCAACGGAGA ACGATGGTTG GAGGGGACGT TCTGGCGGCG GGCCCGGGTG GACGGCAACG ACGTGGGATA CGGTTCCATC GTCGCCTGTG GCCACCACGC CACGACGCTG CACTGGGTCC GCGACGACGG GCCGGTACGC GCCGGTGATC TCGCCCTGCT CGACATGGGG GTCGAGGGAC GGTCCCTCTA CACGGCCGAC GTCACCCGCA CGCTTCCGAT CAACGGACGG TTCACCGAGG TGCAACGCCA GGTCTATGAC GTGGTTCTTC GCGCCCAGCA GGCCGGGATC GACGCGGTCC GGCCGGGAGC GTCCTTCCTG GAGCCGCACC GGGCCGCCAT GCGGGTCATC GCCACGGCGC TCGACGACTG GGGGCTCCTG CCGGTCAGTG CGCAGGAGTC GCTGACCGAG GACCCACGCT CCCCGGGAGC GGGCGTCCAT CGCCGCTATA CCCTGCACTC CACGTCGCAC ATGCTCGGAT TGGACGTGCA CGACTGCGCG CAGGCGCGCG ACCAGACCTA TCGCGACGCG GCGCTGGAAC CGGGCATGGT ATTGACGGTC GAACCGGGCC TGTACTTCCA GCCGGACGAC CTGATGGTTC CGCCGGAACT GCGAGGCATC GGCGTCCGGA TAGAAGACGA CATCCTGGTC ACCAGGGATG GAAGGCGGAA CATGTCCGCC GCACTTCCGC GAACCGCGGA AGACGTCGAA AACTGGATGG CCCGGCAGCT GCCGAACTGA
|
Protein sequence | MSAQSHQSSP PPTGTGVPES APAGRDPNLI TTDEPGVVPT GASEPPASDD SVREPGQANH DEEPHAAFRR FMSTGWAPID DAVPSRDDCA PYTTKRRALL GSRFPADALV VPSGGLRVRA NDTDFPFRPG SDFFWLTGCH EPDAVLVLLP NAAGEHDAVL YVAGRSDRSS SAFYTDRRYG ELWVGPRPGV RETTAQLEIE CRPLTELADA LARLAPGNTR VLRGVDPVVD GSVLRWSGRG GPGTDRDLAL AQALSELRLL KDDFEIARLD EAIAATVRGF TECVGEIGRA MDLPNGERWL EGTFWRRARV DGNDVGYGSI VACGHHATTL HWVRDDGPVR AGDLALLDMG VEGRSLYTAD VTRTLPINGR FTEVQRQVYD VVLRAQQAGI DAVRPGASFL EPHRAAMRVI ATALDDWGLL PVSAQESLTE DPRSPGAGVH RRYTLHSTSH MLGLDVHDCA QARDQTYRDA ALEPGMVLTV EPGLYFQPDD LMVPPELRGI GVRIEDDILV TRDGRRNMSA ALPRTAEDVE NWMARQLPN
|
| |