Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2803 |
Symbol | |
ID | 3904949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3301339 |
End bp | 3302637 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637880124 |
Product | general substrate transporter |
Protein accession | YP_481890 |
Protein GI | 86741490 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTT CGATGACAGC AACAACCGGC CTCCGGGAGC GGACCAACCA CTATCGGCAG CTCCTGGCCG CCTCCGTCGG CAACGCCGTC GAATGGTTCG ACTGGTACAT CTACTCGATG CTTGCGGTGT ACTTCTCCAC GCAGTTCTTC CCGGCCAGCC CGGGAGGCAG CCTGGTGCCG CTGATGTCGA CCTTAGCGAT CTTCGCCGTC GGATTCTTCG CCCGCCCGGT CGGAGCGCTG ATCCTCGGGA TTCTCGCCGA CCGGATCGGC CGCCGCAGGA CGCTCAGCAC CACGATTATG GGCATGGGCC TGGGCAGTCT CATCATCGGG CTTGCCCCCA CCTACAAGCA GGTCGGGGTC GCCGCCCCGG CACTGCTCCT GCTGGCCCGG CTGATCCAGG GTGCCTCCGC GGGGGGCGAG TATGCCGCGG GAAGCGCGTT TCTCATCGAG TCCGCGCCGG CTGGCCGTCG GGGACTGTAC TCGAGCTTCT TCTACATCAG TGCGACCTCG GCCAACCTGG CGGCGATCGG GATCAGCGCC TTACTCGCCA GCACGCTGGA CTCGCAGAAC ATGACCAGCT GGGGCTGGAG GATCCCCTTC CTGCTCGGCT CGGTCGCGGC CATGGTCGGC ATGTGGATCC GGACGCACGC CAAGGAGACC CTCACCCAGA CGGCGGACGA GCCGGCCGGT GCCCGTCGTG CCGACATGTT CGAGTTCCTC CGCCGGCACC CCAGGGAATC CCTCCAGGTG TTCGGGCTGA CCGCGGCCCC GGCCCTGGTG TTCTATGTAT GGACCGCCTA TCTGCCGACG TACGCGAACA TCACGGTGGG CTCCAACCTC AAGCACGGCC TGCTCTCCGG TGTCGTCGCG CTGACCATCT TCCTCGCCCT GCAGCCGGTC TTCGGAGCGA TCTCGGACCG CGTCGGCAGG CGGCCGATGC TGCTCATCTT CGGCACCTTC TTCGTGGTCG GCACGGTTCC GATGCTGGCC CTACTGGACG GCTCGACCAC CAGACTGCTC CTGGTTCAGA TCATGGGGCT GGTGTTCCTC GCCTGCTGGT CGTGCATCTC GAGCGCGGTG GTCGCCGAGC TGTTCCCGGC GCGCCTGCGC AGCTCCGGTA TCGGGTTCCC CTATGCGCTC TCCGTCGCCC TCTTCGGCGG CACCGGCCCC TACGTCGCCA CCTACCTCGT GGACATCGGC CACGCCGCCT CCTTCGGCTG GTACGTCACC GCCGTCGCCC TGGTCAGCAC CGTGGTCTTC AGCCGGCTCC CCGAGACCGC GCACCGGCCC CTGCACTGA
|
Protein sequence | MTASMTATTG LRERTNHYRQ LLAASVGNAV EWFDWYIYSM LAVYFSTQFF PASPGGSLVP LMSTLAIFAV GFFARPVGAL ILGILADRIG RRRTLSTTIM GMGLGSLIIG LAPTYKQVGV AAPALLLLAR LIQGASAGGE YAAGSAFLIE SAPAGRRGLY SSFFYISATS ANLAAIGISA LLASTLDSQN MTSWGWRIPF LLGSVAAMVG MWIRTHAKET LTQTADEPAG ARRADMFEFL RRHPRESLQV FGLTAAPALV FYVWTAYLPT YANITVGSNL KHGLLSGVVA LTIFLALQPV FGAISDRVGR RPMLLIFGTF FVVGTVPMLA LLDGSTTRLL LVQIMGLVFL ACWSCISSAV VAELFPARLR SSGIGFPYAL SVALFGGTGP YVATYLVDIG HAASFGWYVT AVALVSTVVF SRLPETAHRP LH
|
| |