Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3124 |
Symbol | |
ID | 5671502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3679244 |
End bp | 3680332 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641242021 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001507441 |
Protein GI | 158314933 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.45434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCACC GCAAGAAGAT CGCGTTAGCG GCGTTAGTCG CCGTGTCCGT CATGATGCTC GCTGCCTGCG GGTCGGGCAC TGGATCCGAG GCGGACGGCG AGTCGAAATC GATAACTTTC ATGCTGCCCA CCCCGTCGTG GGACGTGTCC CTAGCGGCTT TCGCCGTGGC GCAGGCGAAG GGGTACTTCG CAGAGGAGAA CCTCAACGTG AAGTACGTCC TGACCAAAAG CAGCCAGCTC GCGGCCACGA CCGTCGCTCA GGACACCGAC TCCGTCGGGC TGGTCAGCCC AGAGCCCGTT ATGATCGCAG CTCAAACAGG TAAAGGTCTG GGGCTGGAGT ACTTCTATAA CTTCTTCCGT CGGCCGATCT ACAATCTGGC GGTCCTAGCA GACAGCGAGA TAAAGGACCT CCACGGCCTC CAGGGCAGGA AGGTCGGCGT TCAGAGCCTA TCCGCGGTTG GCGTCTACTA CGTCAAGGCC TACCTAGCCG AGGCCGGGCT CGCCCCTGAC ACCGTGACGC TCATCCCGAC CGGTAGTGGG ACGCAGGCAC TCACGGCACT GGAGGGCAAG CAGGTCGATG CCATGCTGGT CAACGACGTC TGGCCGGCGC AGTGGAGGAA CGCCGGGGTC GAGACCCGCA GCATCTCGAC GACGAGCCAA CTGTCGGTCG TCCAGCACGG GCTGCTGACG AGAGCGGAGA ATCTGAGGAA TAACCCTGAC ACGTATGCGG CGCTCGGGCG CGCCGTCGCC AAGGCTACGT TGTTCACCCT CGAAAACCCC GAGGCGGCCA TCACCATGAT GTGGAAGGCA CAGCCCGAGA CGAAGCCGAC CGGAATCGAT GACACGGAGG CCATGAGGCA GAGTCTCATG ATTCTCAACG CCCGGATCCC GAACCTGGAG CTTGGTGCCG GCGAGACGAT GTGGGGCCAG TACCCGGACC GTGCCTTCGC CGACTCCGTC AAGTTCGCGA CCGACAGCGG CCTGATCACC AAGGATATCG ACCCGAACGT CTTGTCCACA AACGACCTAG TGGTGAAGAT CAACGACTTC GATGCGGCTG CGGTCACGGC TGACGCGGAC AGCAGTTGA
|
Protein sequence | MRHRKKIALA ALVAVSVMML AACGSGTGSE ADGESKSITF MLPTPSWDVS LAAFAVAQAK GYFAEENLNV KYVLTKSSQL AATTVAQDTD SVGLVSPEPV MIAAQTGKGL GLEYFYNFFR RPIYNLAVLA DSEIKDLHGL QGRKVGVQSL SAVGVYYVKA YLAEAGLAPD TVTLIPTGSG TQALTALEGK QVDAMLVNDV WPAQWRNAGV ETRSISTTSQ LSVVQHGLLT RAENLRNNPD TYAALGRAVA KATLFTLENP EAAITMMWKA QPETKPTGID DTEAMRQSLM ILNARIPNLE LGAGETMWGQ YPDRAFADSV KFATDSGLIT KDIDPNVLST NDLVVKINDF DAAAVTADAD SS
|
| |