Gene Franean1_2907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2907 
Symbol 
ID5671294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3423231 
End bp3424532 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID641241814 
Productputative branched-chain amino acid transport system 
Protein accessionYP_001507234 
Protein GI158314726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.787747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACA CAGCACTCCC GCAGGGCAGG TTGACGCGAA GGCGGTCACG ACGGCGGGCA 
GTGGGCACCC TCTTAGCCGC ACCGCTGGCC CTGCTCGTCG TCGCCGCCGC ATGCGGGACC
GACGACGGTG GAAACACCCC AGGCGCTGGC GGTGCAACCA CGAACTCCGA CGTGCTCGGC
CCCGTGAACA AGGCGGCCGG CGAGCCCGTC AAGATTGGGA TCATCTCGGA CGGCAAGGCA
CCCGCCTTCG ACAACTCCAT CCAGTTCGAC GTGGCGGACG CCACCGCCGC GTACCTCAAC
GAACACAAGG GAGGCATCGG CGGGCGGCCC ATCGAGCTGG TGAACTGCGA GACCCAGGCC
GACCCTGCAA GGGGCGCCGA CTGCGGGAAC CAGATGGTCG AGCACGACGT CGTCGCGGTC
GCCATCGGCG AGTCGGCGGT CGCCGAGAGC GTGTCGAAGC CGTTGGCCGA CGCGAACATC
CCGGCGATGT TCTTCGGCGC GGGTAGCCCG GCAGTGGTGG GCGACGCCGA ATCGACCTAC
GTCCTGGGCG ACCCGACCTA CGCGGTCCTC CAGCTACCGA TCGGCATCGC AAAAGACCAG
GGCGCGAAGA AGGTGACCGC AATCGTCATC GACGTGCCGG CCGCGCTCCA CACGTCGCGG
GAGGTCGCAC CAGCGGCCAT GGAGAAGGCG GGGCTCGACT ACGAACTCGT CACAGTGCCC
CCCGGCACGG CCGACATGAC ACCGCAGATG CAGAGCGTCG TCGATGGGGA CCCAGGCATC
GTGTTCGTCC TGGGCGGTGA CTCGTTCTGC ATCAGCGCGT TCAACGGACT GAAGGCAGTC
GGGTACGACG GCACGATCAG CGCCATCTCG CAGTGCATCT CCGACGCGAC CCGCAAGGCA
GTACCAGCGG ACGTACTCGA CGGGATGGTA GTCGGCGCCT CGGTGCCTGC CGGCGGGAGC
GATCCCTCCC GCGTCCTGTA CAGCACCGTT CTAGAGACCT ACGGCGGCAA TATCGACACC
AACTCAACCA CCGGCCGGGG CATGTTCGTG GTGTTCGACG GCCTCATAAC TGCGCTCGGA
GGCATAACGG GAGAAATAAC CACGGAGACG GCCAACGCGG CGATTAAGGC TATGCCGGAG
ACGGAACTAC CGGGAGCAGC ACTGTTGCAG TTCCGCTGCA ACGGAAAAGC ATATCCGGAA
ATGCCGGCGG TCTGTGTGCG CGGCGGGTTG TCCACGACTC TCAACAGCGA TGGTCAGCCC
ACAGAGTACA AGGTTATCGG CACCTCTCCG ATCCCAGACT GA
 
Protein sequence
MQDTALPQGR LTRRRSRRRA VGTLLAAPLA LLVVAAACGT DDGGNTPGAG GATTNSDVLG 
PVNKAAGEPV KIGIISDGKA PAFDNSIQFD VADATAAYLN EHKGGIGGRP IELVNCETQA
DPARGADCGN QMVEHDVVAV AIGESAVAES VSKPLADANI PAMFFGAGSP AVVGDAESTY
VLGDPTYAVL QLPIGIAKDQ GAKKVTAIVI DVPAALHTSR EVAPAAMEKA GLDYELVTVP
PGTADMTPQM QSVVDGDPGI VFVLGGDSFC ISAFNGLKAV GYDGTISAIS QCISDATRKA
VPADVLDGMV VGASVPAGGS DPSRVLYSTV LETYGGNIDT NSTTGRGMFV VFDGLITALG
GITGEITTET ANAAIKAMPE TELPGAALLQ FRCNGKAYPE MPAVCVRGGL STTLNSDGQP
TEYKVIGTSP IPD