Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5692 |
Symbol | |
ID | 5674018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6910941 |
End bp | 6912023 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641244545 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001509948 |
Protein GI | 158317440 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACTG GCAGCAGTAC GGGTTTCGAC AGACGCGGGT TCCTCCGCGG GGGAATGGGG ACGACGGCCG GGGTGGCGCT GCTCGCGCTC GGCGGCGGTG GCCTGCTCGC CGCGTGCGGC GATGACAGCA GTGACGGCGG CTCGTCCGGC AGCGGTTTCG GTGTCCTCGA CATCCGCTTT TCCTGGATCA AGAACACCGA GTTCGCCGGT GCCTACATCG CGGATCAGAA GGGCTACTAC AAGGCGGCCG GCTTCTCCGG CGTCAACATG ATCGCCGGCG GCCCGTCCGC GACGCCACAG GACACCGACG TCGCGACCGG CAAGGCGTTC ATCGGCATCT CCGCCCCCGA CATCACCGGC AACGCCATCC TCAACGGCGC ACCGATCAAG ATCATTGGCG CGCAGTACCA GAAGAACCCG TTCGCGATCG TCTCCATGGC GGACAAGCCC ATCGCCGGCC CCCAGGAGAT GATCGGCAAG AAGATTGGTG TGCAGGCGAC CAATGAGAGC GTGTGGACGG CGTTCCTCAA GGCGAACAAG ATCGACCCGA AGTCGATCGA AAAGGTGCCG GTCGAGTTCG ACCCGCTCCC GCTCACCACG GGCACCGTGG ACGGCTGGTT CTCCTTCGTC ACGAACGAAC CGAACCTGCT GCGCGTCAAG GGTTTCGAGG TGACGACCTT CCTGCTCGCC GACCACAACT ACCCGTTGGT CTCCGAGACT TACATGGTCC GTACCGAGTC CATCGACAAG GAACGGGACA AGATCAAGTC CGCGCTCACC GCCGAGATCC GCGGATGGAA GGACTCGCTG GCCGATCCCG CGCTCGGCGC GCACCTGGCG GCGACCGTCT ACGGAAAGGA CCTCGGCCTG GACGAGGAGG AGCAGACCCT GGAGAGCAAG GACCAGAACG CTCTCATCCT CACCGCCGAC ACGAAGGCGA ACGGCCTGTT CACGGTCACC GACAAGCTTG TCGAGGAGAA CATCGACACC CTCGGGATCG CCGGCGTGAG CATCACTGCC GACGAGCTGT TCGACCTGTC CATCATTAAG GAGCTCTACG AGGAGAAGCC TGATCTCGTT TGA
|
Protein sequence | MATGSSTGFD RRGFLRGGMG TTAGVALLAL GGGGLLAACG DDSSDGGSSG SGFGVLDIRF SWIKNTEFAG AYIADQKGYY KAAGFSGVNM IAGGPSATPQ DTDVATGKAF IGISAPDITG NAILNGAPIK IIGAQYQKNP FAIVSMADKP IAGPQEMIGK KIGVQATNES VWTAFLKANK IDPKSIEKVP VEFDPLPLTT GTVDGWFSFV TNEPNLLRVK GFEVTTFLLA DHNYPLVSET YMVRTESIDK ERDKIKSALT AEIRGWKDSL ADPALGAHLA ATVYGKDLGL DEEEQTLESK DQNALILTAD TKANGLFTVT DKLVEENIDT LGIAGVSITA DELFDLSIIK ELYEEKPDLV
|
| |