Gene Franean1_5891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5891 
Symbol 
ID5674213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7153452 
End bp7154708 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID641244740 
ProductABC transporter related 
Protein accessionYP_001510142 
Protein GI158317634 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.559875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCGC AACCGCTCGC CCCGCCCCCG ATGCAGCCCG CGCAACCGGC CGCCTCCGCG 
CCGTCGGCGC CGCCGGCACC GGGCGCCCCG GTGGTCATCC GCGCCTCCGG CGTCGGCAAG
AAGTTCGTCG CCTATCACAA GCGCGCCACC AGCCTGAAGG AACGCTTCGT CCGGCGGGAC
ACGACGAGCG GCGAGGACTT CTGGGCGCTG CGCGACATCG ACGTCGAGAT CGGCCGCGGG
CAGACGGTGG GGCTCGCCGG GGCGAACGGC TCGGGCAAGT CGACGCTGCT CAAGGTACTC
GCCGGAATCC TGCGCCCGAC CCACGGCGAC GTGTCCGTCA GTGGTCGGAT CGCGTCCCTG
CTCGAGCTGG GCGCGGGGTT CAACGGCGAG CTCTCCGGCC GGGACAACGT CTACCTCAAC
GCGTCCCTGC TCGGCCTGTC CAAGCGCGAG ATCGACCGGC TCTTCGACTC GATCGTCGAC
TTCTCCGAGC TGCGCCACAA GATCGACGAC GAGGTCAAGC ACTACTCGTC CGGCCAGTAC
GTGCGCCTCG GCTTCGCCGT GGCCGTGCAC GTCGACCCGG ACGTCCTGCT CGTCGACGAG
GTGCTCGCCG TCGGCGACGA GGCGTTCCAG CGCAAGTGCC TGGCCAAGAT CGCCCAGTTC
CGCGACGAGG GCCGGACGAT CCTGTTCGTC ACCCACTCCC TCGACCTGAT CGAGAACATG
TGCGACCGGA TCCTGGTGCT GGAGTCCGGC GGCCTGATCT TCGACGGGCA GCCGGTCGTC
GGGACGAAGC TGCTGCGCCA GCGGCTGGGC AGCCTGCCCG CGGACAGTCC GATCCCCTTC
GACCTGGCGC CGGTGAAGCC GACGGCGGTG GCGTTCAGCC GCAACCCGGG CGGGCCGACC
GAGGTGCAGT ACGACCCGGG TGAGCAGCTC ACGGTCAGCG TCGAGCTCGA CCTCACCGAC
AGCGCGCCGC AGTACGTCTA CCTGCACGTG GAGATCATGG GGCAGGGAGA GGTACCGATC
TGGATGATGG AGACCCCGCC GGGCGGGGTC AGCCCCGGCC CGGGGACCGC CGTGGTGGAC
TTCGTCGTCC CCCGGCTGCC CGAGCTGCTG GGGGCGTTCG CGATCAACGT CCGGGTGTCG
GACGCGGCCA CCGGCCAGCC GGTGACGGTA CGGCGCTTCG AGGAGCTGTT CGGGGTCAGC
GGCCCGCAGG TCGCCGGGCT GCTCAAGGTC GACTACGAGG CGAGGCTGCG CCGATGA
 
Protein sequence
MPSQPLAPPP MQPAQPAASA PSAPPAPGAP VVIRASGVGK KFVAYHKRAT SLKERFVRRD 
TTSGEDFWAL RDIDVEIGRG QTVGLAGANG SGKSTLLKVL AGILRPTHGD VSVSGRIASL
LELGAGFNGE LSGRDNVYLN ASLLGLSKRE IDRLFDSIVD FSELRHKIDD EVKHYSSGQY
VRLGFAVAVH VDPDVLLVDE VLAVGDEAFQ RKCLAKIAQF RDEGRTILFV THSLDLIENM
CDRILVLESG GLIFDGQPVV GTKLLRQRLG SLPADSPIPF DLAPVKPTAV AFSRNPGGPT
EVQYDPGEQL TVSVELDLTD SAPQYVYLHV EIMGQGEVPI WMMETPPGGV SPGPGTAVVD
FVVPRLPELL GAFAINVRVS DAATGQPVTV RRFEELFGVS GPQVAGLLKV DYEARLRR