Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3271 |
Symbol | |
ID | 5671645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3875115 |
End bp | 3876323 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641242163 |
Product | putative Leu/Ile/Val-binding lipoprotein transmembrane |
Protein accession | YP_001507583 |
Protein GI | 158315075 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTAG GGCGACGGAA AAGGCTGGTG GCCACGACGG CCGCGATTCT TGCGGCTACC GCGACCAGTT GTTCGAACTC CGGTTCCGGA AGCGATCCTA TTGTCGCCTG CGAGAGCCCT GGTGTCACCT CTGACCAGGT AAAGTTCGGC CTGGTCTTCT CCGACTCGGG AGCCGGAAAC CAGACACTGT CCTCGGCTCG TGCCGGGGTC GACGCCAGGA TCGGCCTGGC GAACCAGGAA GGTGGAGTCA ACGGCCGTCG CCTCGTCTAC GAATGGCGCG ACGACGCGGC CTCCCCATCG CAGAACGCCA AGGTGGTCGA CGATCTCGTC AACCAGGAAT CGGTGTTCGG GGTCGTCGCT GTCACCACGT CGACCAGCGG TTCTATCGAG AACCTGGGAT CGGCGGGCAT CCCGGTCGTG GGTCTCGCCG ACGCCACCTG GAAGACCCAC CCGAACATGT TCTCGAATTC CTACGAGACG TCACCGCAGA GCGTAGGCCA ATATCTCCAG GCCAACGGCG GAACAAAGGT CGCTTTTGTC ACCACAGGCT CGTCAGCCTA CACAGTGGGC TATGCCGAAC AGTATGCGTC GGCCATGCGG GCGATGGGGC TGACCGTGGT GGGAACGGCG TCGTACTCAA GCGGCGACAG CCCGGTTCGG GTGGCCCAAC AGCTGGCCGA CTCCGGAGCC AACGTCATCG TGGGCCTCAC CACCCCGAAC GACATCGCCA GCATCATGCA TGCGGCTCGC ACCATAAACG CCAGTTTCGC CGCCACCGTC TCGCTCGCCG GGTACGACCG CGGTGTGCTG AACACGCTGG GGACGGACCT GGCTGGCGTC TCGTTCCCGG TGTACTTCCG CCCCTTCGAG GCCGGTGGAC CGGCCATCGA CCACTATCGA AACGCGATCA CACAGTTCGC TCCGGAACTG GTCATGCCCG AGCAGCAGTT CGCGATGTAC GGGTACATCT ACGCGGACCT GTTCATCCGG GGGCTCCAGG AGGCCGGCGC CTGCCCCACC CGGGAGAACT TCATCAGCGG GCTACGCCCC GTGACCGGCT ACAACGCGGG TGGTCTGATC GAACCGGTCG ACCTCGCCAC CAACATCAAC AAGCCGCTTG ACTGCAGCGC ATTCGTCCAG GTCGACCCAA CGGGCCGCAC CTTCCAGGTC ACCCAGGAAC GCCTCTGCGC CGACGGTACG GGAAGCTAA
|
Protein sequence | MRLGRRKRLV ATTAAILAAT ATSCSNSGSG SDPIVACESP GVTSDQVKFG LVFSDSGAGN QTLSSARAGV DARIGLANQE GGVNGRRLVY EWRDDAASPS QNAKVVDDLV NQESVFGVVA VTTSTSGSIE NLGSAGIPVV GLADATWKTH PNMFSNSYET SPQSVGQYLQ ANGGTKVAFV TTGSSAYTVG YAEQYASAMR AMGLTVVGTA SYSSGDSPVR VAQQLADSGA NVIVGLTTPN DIASIMHAAR TINASFAATV SLAGYDRGVL NTLGTDLAGV SFPVYFRPFE AGGPAIDHYR NAITQFAPEL VMPEQQFAMY GYIYADLFIR GLQEAGACPT RENFISGLRP VTGYNAGGLI EPVDLATNIN KPLDCSAFVQ VDPTGRTFQV TQERLCADGT GS
|
| |