Gene Franean1_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4038 
Symbol 
ID5672396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4814917 
End bp4817367 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content71% 
IMG OID641242914 
Productinner-membrane translocator 
Protein accessionYP_001508331 
Protein GI158315823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0559] Branched-chain amino acid ABC-type transport system, permease components
[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.354929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGCCCA GCAGACGGGC GGCTGCTGAC TTGAACCAGA CCTCCGTCTC CCACGACTTC 
TTCCAGGCCC TCCTTCAGGG AGTACCGCCG GGAGCGGTCT ATGCCCTCGT CGCGTTGGGC
TTCGTCCTGA CGTACAAGAC CTCGGGTGTT TTCAACCTTG CCTTCGGCGC CCAGGCCTAT
GTCTCGGCAG CCATGTACTT CAAGGCCCGG GTCATCTGGG AGTGGCCGAC CGTCCCGTCG
GTCATCCTCG CGGTCTTCGT GATGGCCCCG CTGATCGGGC TGATCCTCGA ACGGCTGATC
TTCCGGCCGC TGCGGACGGC ACCGGCGGTG GCGCGGCTCG TCGTGGCCGC CGGCCTTGCG
GTCGCGATTC CCTATCTGTT CGACATCCTG GCCGACTTCA CCGCGGTCGC CGGCGTGACG
CCGGTCGGTG TCGTTCCCGA CGGCGCCAAC GTCTTCTACG ACCCGTTCGG GGTCTACGCG
TACAGCCGCA ACGAGCTCGT CAGCATGGGC GTCGCCCTGG TCGCGATGGC AGGGCTGGTC
GCGCTGTTCC GCTTCAGCGC CATCGGCGTC CGCATGCGCG CGGTGGTCGA GAGCCCGAAG
ATGACCGAGC TCAACGGTAT CCCCGCCGAC CGGGTCTCGG CCCTGTCGTG GGCGCTGTCG
TCGCTGTTTG CCGGTGTCGC CGGCGTGCTC ATCGCGCCCC GCTTCAACAC GCTCGCGGCC
GCGGACTTCT TCAGCCTCAT GGTGGTCGCG ATCGCCGCGG CCGCGGTGGG CCGGCTGACG
AGCCTGCCGA AGGCGATGGC CGGCGGCCTG CTGCTCGGCA TCATCATCGC CCAGCTCAAC
ACCTTCCTCC CGCGCTGGTC CGACGACCAC GCCTGGGTCT CGACGATCCA GGACAACCTG
ACGCCGTCGG TGCCGTTCGT GGTGCTCTTC GGCGTCGTCG TGCTCGTCCC GAGCATCCGG
CGCTCCCGGG AGACCGGTGA CCCGCTGGCC GGGGTCGAGC CGCCGCCGCC GTCGCTCGGC
GGCGAGGTGC GCGACCCGCG CCGGGCCCTG ATCACGAGGC TGATCGGCTT CGCCGCGCTC
GGCGTCGTCG CCATCGTGGT GCTGGCCCGC GGCGACCAGC TGTGGGTCTT CCTGGTGACG
CAGGCCGTGG TGATCGGCAT CATCTTCCTG TCGATCACCG TGATCACCGG CATGGCCGGC
CAGATCTCGC TGTGCCAGGG GACGTTCGCC GCCATCGGGG CGTTCACGGT CTTCCAGCTC
GTCGACCGCT ACAACCTCTC GGTCCTGATG GCGGCGCTGA TCGGCGCCGC CATCGCGGCC
GTGGTCGGCG CGGTCCTGTC GCTGCCCATC CGCAAGCTGG GCGGCATCTG GACGGCGATC
GCGACCCTGG CGTTCGCCTA CTTCTTCGAC GCCGTGTTGG TGAAGCTCTC CTGGATCGGT
GGCGGGGACT CCGCGCTGCT GCAGGGCACC GCCGTGCCGC GCCCGGTCAT CGGGCCCTGG
GACCTGGCGG ACGACAAGTA CTACCTGGTG TTCGCCTCGG TGATCCTCAT CGTGGTCGCC
ATGGTCGTCC TCCAGCTCCG CAAGGGTACC TTCGGCCGCA CCCTCGTCGC CCTGCGCGGC
AGCGAGGTCG GCGCCGAGTC GATCGGCATC TCCGCCGGCC GGGCCCGCCT GGTCGCCTTC
GCGGTCTCCG CGTTCATCGC GGGACTCGGT GGCGCGCTGC TGGCGATCCA GCAGGAGAAC
GTCAACTACG GCACGAACTT CGTTCCCTTC GCCGCCCTGT TCTGGGTGGT GATCGTGGTG
ACGCTCGGCT CGCGCACGGT GCGCGGCGCG CTGAACGCCG CCGCGTCCTT CGCGGTGTTC
GACCAGCTGA TCCTCAAGGG GACGGTGTTC GCCTGGATCC TGCGCAGCCC GACCGCCATA
CCGGACTTCT TCCCGATCTC CGGCAAGTGG GTCTACGTGT TGTTCGGCCT GGCGGCCATC
CAGTTCGCCC GGCATCCGGA GGGCCTGGTG GAGCGGCCGG CGAGCATGCC GAAGTTCATC
ACCAAGCTGG TGGCACTGGC CCGCCCCGCG ACCCCGGCCG CGGTGCCGGT GGCCGTCGGC
GCGGGCGCGG CGGGATCGGC TCCCGAGCCG GACACCACCA CACCCACGGG CACCTCGGCG
AAGGCGGCCG CGCCGCCCGC CGCGAAGCCC GCGGAGCCCG CACCCGAGGC ACCACCCGCC
GTGAAGTCGG AACCAGCGCC GAAGTCGGAA CCAGCGCCGA AGTCGGAAGC CACCTCGACG
GCGAAGCCGG CGGCGGCCAC GGAAGCCGCT CCGGCGGCGG CGCAGCCCGC CGCCCCGGTT
GCCACAGCTG AACCCGCGGC CGGTGACGGG TCGCCGAAGG CCGCGGCCAA CGGAGGGTCT
GAGCGTCCGG CGGCTTCCTC GCCCGCGCGG ACGGAGGACG CGGTCTCGTG A
 
Protein sequence
MSPSRRAAAD LNQTSVSHDF FQALLQGVPP GAVYALVALG FVLTYKTSGV FNLAFGAQAY 
VSAAMYFKAR VIWEWPTVPS VILAVFVMAP LIGLILERLI FRPLRTAPAV ARLVVAAGLA
VAIPYLFDIL ADFTAVAGVT PVGVVPDGAN VFYDPFGVYA YSRNELVSMG VALVAMAGLV
ALFRFSAIGV RMRAVVESPK MTELNGIPAD RVSALSWALS SLFAGVAGVL IAPRFNTLAA
ADFFSLMVVA IAAAAVGRLT SLPKAMAGGL LLGIIIAQLN TFLPRWSDDH AWVSTIQDNL
TPSVPFVVLF GVVVLVPSIR RSRETGDPLA GVEPPPPSLG GEVRDPRRAL ITRLIGFAAL
GVVAIVVLAR GDQLWVFLVT QAVVIGIIFL SITVITGMAG QISLCQGTFA AIGAFTVFQL
VDRYNLSVLM AALIGAAIAA VVGAVLSLPI RKLGGIWTAI ATLAFAYFFD AVLVKLSWIG
GGDSALLQGT AVPRPVIGPW DLADDKYYLV FASVILIVVA MVVLQLRKGT FGRTLVALRG
SEVGAESIGI SAGRARLVAF AVSAFIAGLG GALLAIQQEN VNYGTNFVPF AALFWVVIVV
TLGSRTVRGA LNAAASFAVF DQLILKGTVF AWILRSPTAI PDFFPISGKW VYVLFGLAAI
QFARHPEGLV ERPASMPKFI TKLVALARPA TPAAVPVAVG AGAAGSAPEP DTTTPTGTSA
KAAAPPAAKP AEPAPEAPPA VKSEPAPKSE PAPKSEATST AKPAAATEAA PAAAQPAAPV
ATAEPAAGDG SPKAAANGGS ERPAASSPAR TEDAVS