Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3189 |
Symbol | |
ID | 5671565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3759609 |
End bp | 3760916 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242083 |
Product | hypothetical protein |
Protein accession | YP_001507503 |
Protein GI | 158314995 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGC ACCATCGCTG GCGCCGTCCG CTCATGCTGA TCGCCGCGGG AGCCGCACTC GTCTTAGCCG GTTGCAGCGT CGAGTCGGAC GACCCGCCGG CCAGCAGCCC CACCTCGGAC CCGTTCCCGG CCCCGCCGGC GCCGGCGATC TCGCCGGGTG TCACGGCCGA CAGCATCAAG ATCGGTTTCG TCTACCCCGA CCTCGAGGTC GTCAAGCAGT ACGTCGACAT CGACCACGGC GACTACCAGG CCACCTTCCA AGCCCTGGTC GACAAGGTCA ACGCCACGGG CGGCATCAAC AGCCGGAAGA TCATCCCGGT GTACGGCGCG GTCGACGTCA TCTCCCCCGC CGGCGCGCAG GAGACCTGCG TCAAGCTGAC CCAGGACGAG AAGGTCTTCG CGGTGCTCGG CAGCCTCAAC GCCGAGGACG CGCTGTGCTA CGTCCAGACC CACAAGACGG CGCTCGTCGG CGGTGACCTC ACCACGCAGC GCTACGCCAA GGCGCAGGCA CCGTGGGTCT CCGACCTGCG CGGCGGCGAC GAACTCGCCG ACGGCATCGA GCTGTTCACC GCCGACAACA CCCTGGCGGG CAAGAAGCTC GCGGTCGTCG CCTACCGGGA CGACCAGGCG ACGCTGGACA AGGTGGTCCT GCCGGCCCTG CAACGTCTCC AGGTCCCGGT GACCGAGACA GGCATCCTCG ACGCCGACAT CAGTGACGCG GCCGCGGTCT CCCAGCAGTT CAACGTCTTC ATCCAGAAGT TCCGGGCCGC GGGCGTCGAC ACCGTGCTCC TGGTCGGCGG CTCCCAGCTC CAGTTCCCCG CCGAACTCCA GAAGACCGAC TACCGCCCCA GGCTGATGTT CGCGACCACC AGCCAGGCCG GGGCCTACCT GGCGAGCCCG GGCGACCACG ACCCGGCCAT CATGGCCGGC GCCACCGCCC TCGGGCTCGT CGTCGACTTC ACCGAGCCGG CCAACGCCGC GTGCATCGCC ACCCTTGAGG CCGCGCTGCC AGCACTCACC GGCAAGCTGG TCGACCCGGC GACCGTGCCC TCAGGCCAGC CCACCCCCGG AACATCCGAA AGCGCCGCCT GCCGCTACCT GACCCTGTTC CAGGCCATCG CCGAGAAGGC CGGCAAGGAC CTCACCTACC AGTCCTTCCA GCAGGCCGCG TTCTCCCTCG GCTCCTTCCA GGTCCCCACC TACCGGGACA AGGCCACCTA CAGCCGCGAG ACACCCCACG GCGCCGTCCC CCCGCGCCTG TTCACCTTCG ATCCCGCGAA GAAGAACTTC TTCCCCGCCG CGGGCTGA
|
Protein sequence | MTSHHRWRRP LMLIAAGAAL VLAGCSVESD DPPASSPTSD PFPAPPAPAI SPGVTADSIK IGFVYPDLEV VKQYVDIDHG DYQATFQALV DKVNATGGIN SRKIIPVYGA VDVISPAGAQ ETCVKLTQDE KVFAVLGSLN AEDALCYVQT HKTALVGGDL TTQRYAKAQA PWVSDLRGGD ELADGIELFT ADNTLAGKKL AVVAYRDDQA TLDKVVLPAL QRLQVPVTET GILDADISDA AAVSQQFNVF IQKFRAAGVD TVLLVGGSQL QFPAELQKTD YRPRLMFATT SQAGAYLASP GDHDPAIMAG ATALGLVVDF TEPANAACIA TLEAALPALT GKLVDPATVP SGQPTPGTSE SAACRYLTLF QAIAEKAGKD LTYQSFQQAA FSLGSFQVPT YRDKATYSRE TPHGAVPPRL FTFDPAKKNF FPAAG
|
| |