Gene Franean1_3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3395 
Symbol 
ID5671766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4022954 
End bp4025824 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content70% 
IMG OID641242283 
ProductABC transporter related 
Protein accessionYP_001507703 
Protein GI158315195 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGATT TCTTGCCATT CATCGTCATC GGTCTTGCGA CCGGCGCGGT CTACGGCCTC 
GCTGGAATCG GCCTGGTGCT CACCTATAAA ACGTCGGGTA TATTCAACTT CGGCTACGGT
GCGGTAGCCA CGCTCGTCGC GTTCTGTTTC TATTTCCTGA ACGTCGATCA CGGCTGGCCG
TGGCCGCTCG CGGCGGCCGT CTCGCTTCTC GTGTTCGCGC CGTTGCTCGG CCTGGTGCTG
GAGCTGTTGG CCCGGTCGCT CAACGGAGCC AGCGAAACGA TCAAAGTCGT CGCGACCGTC
GGACTGATCC TCGTCGTGTC GAGCATCGGG CTGCTGTGGC ACCCGGTCAA TCCGCCGACC
TTCCCCCATT TCCTGCCGCA GGACACGGTG CGGATGGTCG GGGTGAACGT CACCTGGGAG
GAGATCATTC TCTTCCTGCT GTCGGCGGCG GCCGCTGCCG GGCTCTACTG GTTCTTCCGG
TCCGTCCGGT TCGGCATCGT CATGCGCGGT GTCGTCGACA ACCATGAGCT CATCTCCATG
AGCGGCGACG ACCCGGTACT TGTCCGCCGG GCGGCCTGGG TCATCGGCAG CGTCTTCGCC
GGCGTGGCCG GTCTGCTGCT CGCACCGTCG CACGACCTCG ACGGCGTCAC CCTGACGACG
ATCGTGTTCG CGGCCTTCGG GGCCGCCGCC ATCGGCTACT TCACCAACCT GCCGCTGACG
TTCGTCGGCG GCCTGGTGGT GGGTATCGCC AGTTCGCTGG TCGACAAGTA CTCCGCGACC
ATCACGTGGA TCGGCGGGCT GCCGCCGGCG CTGCCGTTCG TCATCCTGTT CGTGGCCCTC
ATCGTGCTGC CCCGGCGCCT GCTCGCGCAG CGGCGGCTGA CGGCCGTGCT GAGCACCCGG
CGCTCCTACC ACGCTCCGGT CCGGATCCGG CTGACGACGT TCGCGATCGC GATCGTCCTG
CTCGGCCTGG TACCGACGAT CCAGTCCGGC CACATCGCGG TGTGGTCGTC CGCGCTCATC
AACATCATGC TGTTCCTGTC GCTGGGCCTC CTTGTCCGGC GGTCCGGCCA GATCTCGCTG
TGCCACCTGG CGTTCGCCGC GGTCGGCGCG GCCGCCTTCG GGCACTTCTC CAACAGCATG
CCGTGGCTGC CGGCGCTCAT CCTGGCGACG CTGGTCGCCA TCCCGGTCGG TGCCCTGATC
TCGATTCCGG CCGTGCGGGT GTCCGGTGTG TTCCTCGCGC TGGCCACGCT GGGGCTGGGC
ATTCTCGCCG AGCAGGTCTT CTACACCCGG AACTTCATGT TCAGCCAGTC CGCGCTGGGC
ATCGAGGCAC CGCGGCCGGT CTTCTCCATC GGCAGTCTGG ACCTCTCGAG CGACAAGGGC
TTCTACTACC TGCTGCTGGT CATCACCGTG CTCGTCGCCG GCGTCATCAC CGCCATCGGC
CACGGCCGGC TGGGCCGGCT GCTCGAGGCG CTGGCGGACT CGCCGCTCGC GCTGGAGACC
CACGGGACGA CCTCGAGCGT CCTCAAGGTG ATCGTCTTCT GCGTCACCGC CGCGATCGCG
TCCCTGGCCG GGGCGCTCAA CGCGATGCTG TTCCACTTCG GCGTCGGCAC CTACTACCCG
TCGTTCAGCT CGCTGACCCT CGTCGCGCTG GTCGTGATCG TCACCATCGG CGATCCGTGG
TACGCACTCG TCGCGGCCGT CGGCTACAGC GTGCTCCCGG CCTACATCAC CGGCCAGAAC
ACCAGCACCG TCCTCAACCT GCTCTTCGGG CTCGGCGCCG CCACGGCCGC GTGGGGCACC
CGGGGCGGCG TCACACCCGC CCGCCTGCAG GCGCTGCTGG ACCGACTGGG CGGCAGGGCG
GCCCCTGTCA CCTTGGACGG CCTGGCCGCC GACGGCCTGG CCGCCGACGG CCTGGCCGCC
GACGGCCTGG CCGCCGACGG CCTGGCCGCC GACGGCCTGG CCGCCGTCGA CACAGCCGCC
GTCGACGCAC CGGCAACGCC CGGCGTGCCG GCCCGCCAGG TGCCCTCGCC CCGCCGCGAG
GCTCCCACCG GGGACGGCCT GACAGTCCGC GACCTGTCCG TCCGCTTCGG CGGCGTGCAC
GCCGTGAACG GGGTGACCCT CAAGGCCAGG CCCGGGGCCA TCACCGGTCT CATCGGGCCG
AACGGCGCCG GCAAGACCAC GACGTTCAAC GCCTGCAGCG GCCTGCTCCG GCCAAGCTCC
GGCGAGGTCC TCCTGCACGG CGCCAACGTC ACGGGGGAGG GGCCCGCCAG CCGGGCCCGG
CACGGGCTGG GACGGACGTT CCAGCGCACC GAGCTGTTCA ACAGCCTCAC CGTGCGGCAG
AACGTCGCCA TGGGCCGCGA GGCGTCCATG GCCGGCGCGA ACCCGCTCAA CCACCTGGTG
AGCTCGCGGC ACGCGAACCG TGTGGTCTCC GAGGTCGTCG AGGAGTCGCT CGCGCTCACC
GGAACCACCC GGATCGCAGA CCTGCAGGTC GGGCTGCTCC CGATCGGGCA GCGGCGCCTG
GTGGAGCTGG CCCGCGCCCT CGCCGGTCCG TTCGACATGC TGCTGCTGGA CGAGCCCTCC
TCCGGGCTGG ACGGCCACGA GACCGAGCAG TTCGGCCAGG TTCTCCAGAC CGTGGTTCGC
GAGCGCGGCT GCGGTGTCCT GCTCGTCGAG CACGACATGA GCCTGGTCCG GGAGATCTGC
GACTACCTGT ACGTGCTCGA CTTCGGGCAA CCGATCTTCG AAGGGACCCC AGACCAGATG
GAGAGCTCAG ACCAGGTCCG CAGCGCCTAC CTCGGCAGCG TGGCCGTTGC CGCGGACAGT
GCGGACGACC CATCCACCGA CCGGAGCATG CCGCTCCAGC CCCAGGAGTA G
 
Protein sequence
MNDFLPFIVI GLATGAVYGL AGIGLVLTYK TSGIFNFGYG AVATLVAFCF YFLNVDHGWP 
WPLAAAVSLL VFAPLLGLVL ELLARSLNGA SETIKVVATV GLILVVSSIG LLWHPVNPPT
FPHFLPQDTV RMVGVNVTWE EIILFLLSAA AAAGLYWFFR SVRFGIVMRG VVDNHELISM
SGDDPVLVRR AAWVIGSVFA GVAGLLLAPS HDLDGVTLTT IVFAAFGAAA IGYFTNLPLT
FVGGLVVGIA SSLVDKYSAT ITWIGGLPPA LPFVILFVAL IVLPRRLLAQ RRLTAVLSTR
RSYHAPVRIR LTTFAIAIVL LGLVPTIQSG HIAVWSSALI NIMLFLSLGL LVRRSGQISL
CHLAFAAVGA AAFGHFSNSM PWLPALILAT LVAIPVGALI SIPAVRVSGV FLALATLGLG
ILAEQVFYTR NFMFSQSALG IEAPRPVFSI GSLDLSSDKG FYYLLLVITV LVAGVITAIG
HGRLGRLLEA LADSPLALET HGTTSSVLKV IVFCVTAAIA SLAGALNAML FHFGVGTYYP
SFSSLTLVAL VVIVTIGDPW YALVAAVGYS VLPAYITGQN TSTVLNLLFG LGAATAAWGT
RGGVTPARLQ ALLDRLGGRA APVTLDGLAA DGLAADGLAA DGLAADGLAA DGLAAVDTAA
VDAPATPGVP ARQVPSPRRE APTGDGLTVR DLSVRFGGVH AVNGVTLKAR PGAITGLIGP
NGAGKTTTFN ACSGLLRPSS GEVLLHGANV TGEGPASRAR HGLGRTFQRT ELFNSLTVRQ
NVAMGREASM AGANPLNHLV SSRHANRVVS EVVEESLALT GTTRIADLQV GLLPIGQRRL
VELARALAGP FDMLLLDEPS SGLDGHETEQ FGQVLQTVVR ERGCGVLLVE HDMSLVREIC
DYLYVLDFGQ PIFEGTPDQM ESSDQVRSAY LGSVAVAADS ADDPSTDRSM PLQPQE