Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2952 |
Symbol | |
ID | 5671338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3473854 |
End bp | 3476646 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641241858 |
Product | ABC transporter related |
Protein accession | YP_001507278 |
Protein GI | 158314770 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG TTCTTCCGTT CTTAGTCGCC GGTCTGACGA CCGGCGCGGT CTACGGGCTT GCCGGTACCG GATTGGTGTT GACCTATAAG ACGTCGGGCG TCTTCAACTT CGCCCATGGC GCGGTCGCTG CGCTGGCGGC GTACGTGTTC TATTCGCTGT GGGTAAGTCA GGGCTGGCCG TGGCCGGTGG CGGTGGCGGT GGCGGTAGTG GCCGCGGGTC CGGTGCTGGG CCTGGTCCAG GAGCTTCTCG CGCGTTTCAT CCAGGGCTCT AGTCTGGCCT TGCAGGTGGC GGCCACAGTG GGGCTGTTGC TCGTAGTCCA GGCGGTACTG GTGCTGGTCT ACGGGACGTT GGAGACGCGG ACGGTTCCGA TCTTCCTCGG CTCCGGGAAT GTGCGGCTGG CCGGGACGAA TGTTCGCTGG GCGGACATAG CAACATTCGC GTTTGCCGTG GTGGTGACGG CGGGGCTGTC CGTGATGTTC CGCCGAGCCC GGCTGGGGTT GGCGATGCGC GCCGTGGTGG ACGATCCAGC GTTGCTGGAT CTGGCTGGAA CGAGCCCGCG GCAGACGCGC CGGTTCGCGT GGTGCATTGG GGCGACTCTG GCGGCGGCGT CGGGGGTGCT GTTCGCGCCG TTGCTGCCTT TGGACGTGCT ACAGCTGACA CTGCTGGTGG TGGCCGCCTT CGGTGCGGCG GCGATCGGCG CGTTCACGAG CCTGCCGCTG ACGTTCGCCG GGGGGCTGGT GATCGGGGTG CTCGCCTCGC TGGCCACGAG GTACTTCACG ACGGGGCTGC TGGCCGGGCT ACCGCCCGCG CTGCCGTTCG CGGTGCTGTT CCTGGTGCTT CTCGTCTTCC CCCGGCGGTA TCTGTCCGGG CCGGTACGGG TCGTGCCGCG CAGCCGTCCG GCCTGGGCCG CGCCGGCGCC ATTGCAGCTG CTGGCCGGGG TCGGACTGCT GGTCGCGTTG GGGCTCGTGC CATCGTTCGC CGGCATTCAC CTCACCGACT GGACGGCGGC GCTGGGCACG GCGATTGTGT TCCTGTCCTT GGGGTTGCTG GTTCGTACAT CCGGGCAGGT CTCGCTGTGC CATGTGAGTT TCATGGCGAT CGGCGCCGCG GCCTTCTCCC ACCTTGCTGT CGGACACGAC CTGCCCTGGC TCGTCGCGTT GCTAGTCACA GGGCTGATCG CCGTACCAGT CGGGGCGATG CTGGCTGTCC CGGCGGTCCG CTTGAGCGGC CTATATCTTG CGTTGGCGAC GTTCGGGTTC GGGATCTTCC TACAGTACAT GCTCTATACC CAGGACTACA TGTTCGGCTC GATGGGCGCG GGATTGAGCG AGCCGCGGCC GCACCTGTCG TGGCTGAACG TGGAGGATGA CAAGGGCTTC TACTACCTGG TCCTCGTCAT GGCCGTCGCG GCCGTCACCC TCCTAGTAAT CTTGGACCGG AGCCGGCTGG GCAGGCTGTT GCGGGGACTG GCGGAGTCGC CGACCGCGTG GGAGACCTCC GGGGTGACAG TGAGCGTCAC CCGCGTGCTG GTGTTCTGTG TTTCGGCGTT CATGGCGGCG GTCGGTGGAG CGCTGACCGC GGTCGCGCAG AGCACCGTGT CCGCCGACGC GTACCAGCCG ATGCTGTCGC TGACCTATTT CGCGGTCGTC ATCGTCGCGC TGGGCGGCAA CCCCTGGTAC GCGCTGACCG GGGCGGGTGC CCTCGAACTG ATCCCGTCCT ATATCTCTGG CGAGAACACC GCCGCGGTCC TGCAACTGGG TTTCGGTCTG GCAGCGGTGG GCGTTGCGTT AGCTCCCCCA ACGGCGAGCC TGCTCCCCGT CGCCGTGCGG CGTCGCATAG ACGCGGTGTC CCGCCGTCCG CGGCCGTCAG GGGCACGGGC ACCGGAAGGT GCCGCGGCGG TCCCAGCGGC CGCTGCATCG TGGCCGGGGA ACGGGGCGGC GGAAAGACAG GTGGCTGCTG GCGCCCTAGA GGTGCAGGAC CTGAAAGTCC GGTTCGGCGG CGTGCTCGCG GTGGACGGTC TCAGCCTGGC CGCGCCGACC GGCCGGATCA CGGGTCTCAT CGGCCCGAAC GGCGCCGGCA AGACCACAAC GTTTAACGCA TGCTCGGGCC TGGTGCGTCC AAACAGCGGG CGCGTGTTGC TCGCCGGGCA CGACGTCTCT CGCGCCGGAC CGGCGGCCCG GGCCCGGCGC GGGCTGGGCC GAACCTTCCA ACGGATGGAG CTGTTCGACT CGCTGCCAGT CCGGGACAAC GTGGCCGCCG GCGCGGAAGG CGCGCTCGCC GGCGGCAACC CGCTGACGCA CTTGGCCAGC CGGCCGGAGG ACCGCGTGCA GGTCCGCCGC GCCACCGACG ATGCACTGGA GATGTGCGAC CTGACCGCCC TTGCCGACAC GGTGACGGCG AAGCTGTCAA CCGGGCAGCG GCGGCTGGTG GAGCTGGCCC GCTGCCTCGC GGGCCCGTTC CAGATCCTGC TGCTCGATGA GCCGTCTTCA GGGCTGGACC GGGTGGAGAC CGTCCGGTTC GGGGAGATCT TGCGCCGCGT CGTCGCAGAG CGCGGCGTTG GGATCCTGCT CGTCGAACAC GACATGGCAC TTGTCCTCGA CATCTGCGAG ATGATCTACG TTCTCGACTT CGGTCGACTC GTGTTCGCCG GGAGCCCGGG CGAGGTCGTC GCATCTTCGG TCGTGCAGGC CGCCTACCTC GGCGACACGG CTGTCGAAGC CGCCGTCCCA CAGACGGCGC AAGGTGAGGC AACCGTGAGC GGGCTCGTGT CGGATGAGGA GGTCGTGGCA TGA
|
Protein sequence | MSDVLPFLVA GLTTGAVYGL AGTGLVLTYK TSGVFNFAHG AVAALAAYVF YSLWVSQGWP WPVAVAVAVV AAGPVLGLVQ ELLARFIQGS SLALQVAATV GLLLVVQAVL VLVYGTLETR TVPIFLGSGN VRLAGTNVRW ADIATFAFAV VVTAGLSVMF RRARLGLAMR AVVDDPALLD LAGTSPRQTR RFAWCIGATL AAASGVLFAP LLPLDVLQLT LLVVAAFGAA AIGAFTSLPL TFAGGLVIGV LASLATRYFT TGLLAGLPPA LPFAVLFLVL LVFPRRYLSG PVRVVPRSRP AWAAPAPLQL LAGVGLLVAL GLVPSFAGIH LTDWTAALGT AIVFLSLGLL VRTSGQVSLC HVSFMAIGAA AFSHLAVGHD LPWLVALLVT GLIAVPVGAM LAVPAVRLSG LYLALATFGF GIFLQYMLYT QDYMFGSMGA GLSEPRPHLS WLNVEDDKGF YYLVLVMAVA AVTLLVILDR SRLGRLLRGL AESPTAWETS GVTVSVTRVL VFCVSAFMAA VGGALTAVAQ STVSADAYQP MLSLTYFAVV IVALGGNPWY ALTGAGALEL IPSYISGENT AAVLQLGFGL AAVGVALAPP TASLLPVAVR RRIDAVSRRP RPSGARAPEG AAAVPAAAAS WPGNGAAERQ VAAGALEVQD LKVRFGGVLA VDGLSLAAPT GRITGLIGPN GAGKTTTFNA CSGLVRPNSG RVLLAGHDVS RAGPAARARR GLGRTFQRME LFDSLPVRDN VAAGAEGALA GGNPLTHLAS RPEDRVQVRR ATDDALEMCD LTALADTVTA KLSTGQRRLV ELARCLAGPF QILLLDEPSS GLDRVETVRF GEILRRVVAE RGVGILLVEH DMALVLDICE MIYVLDFGRL VFAGSPGEVV ASSVVQAAYL GDTAVEAAVP QTAQGEATVS GLVSDEEVVA
|
| |