Gene Franean1_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0108 
Symbol 
ID5668533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp126171 
End bp127487 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content72% 
IMG OID641239036 
Productputative ABC transporter 
Protein accessionYP_001504481 
Protein GI158311973 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1668] ABC-type Na+ efflux pump, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.850247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGG AATCCGATCA CGAACACGGC CGCGAGGCCA CGGCCGGCCC GGCCCCAGGC 
ATCCCGGCCC AGACCGGTGC CGGGCCGGCG CCGGCGCCGG GTTCCGGTGG CCGGGGCGAC
TATCCGGCGC TGGGCTCCTG GGCGACCGTG CGGCTCGTCG CCGGTCGCGA GCTCGGCGTA
CGGCTGCGTT CCAAGGTCTT CCGGATCACC ACCGTCGCCC TCCTCGTCCT GCTCGTCGGC
GCCGCGGTCG TGATCGATCT GGTCGACGGC GGCGATTCGA CGGAGTCCGT CGGCGTGACG
GCGGCGGAAT CCGCGATCAC GGCGCCGCTC ACCGCCGCCG CCCGGAGCCT GGACGTCGGC
ATCACGACCC GCGAGGTGCC CGACGAGGCC ACCGGGTTGC GCCAGGTCGC CGACGGCGAT
CTGGACGCCC TGGTGACCGC GTCCCCGAAC GGCCTGCGCG TTGCCGTCAA GGAGGACCTG
AACGACGAGT GGCGCGCCGT GCTGGCTGTT GTGGCCCGCC AGCAGGTTCT CGACAACGAG
ATCAGCGTGC TCGGCGGCGA CCCGGCCCGG GTGAACGAGG CGGTCGCCGC CACGCGGGTC
GATGTCACCC AGCTCGACCC GGCGCCGGCC CACCAGGGCG AGCGGCTCGT CCTCGGGATC
GCCGCCGCGT TGCTGATCTA CATGGGCCTG ATGCTCTACG GGCCGGCCGT TTCCCAAGGG
GTGGTCGAGG AGAAGTCGAG CCGGGTGGTC GAGCTGCTGC TGTCGACGGT CCGCCCGTGG
ACGCTCATGG CCGGCAAGGT GTTGGGTATC GGGCTGGTAG CGCTGATCCA GATGATCGTG
CTGGCCGGCG GGGGCCTGGT CGCCGCGCTG GTCACCGGCG CGCTCTCGCT GCCCTCCGGC
GAGGCCACCG GGACGGTGAT CTGGTCGGTG GTCTGGTACG TCATCGGGTT CTTCCTCTAC
GCCCTCCCGT TCGCCGCGGT CGGGGCGATG GTCTCCCGGC AGGAGGACGT CGGGGGCATC
TCGAGCCCCA TCGTGCTGGC GATCGTGGTG CCCTGGGTGC TGGGGATCTC GATCGTGCCG
GGCGATCCCG ACAACGGCCT GATCGCGGTG CTGTCCCTGC TGCCGATCTT CGCGCCGGTG
CTGATGCCGA TGCGGATCGC TCTCGGGGTG GCGCCGGTGT GGCAGTTGGT CCTCTCGGTG
GTGCTCGCCT TGGCGCTGAT CGGCGTGCTC ATCCGGCTCA CCGGGCGGAT CTACCGCAAC
GCGGTGCTGC GGACCGGGGC GAGGGTCTCC TTCCGGGACG CCCTGCGCGA GGCCTGA
 
Protein sequence
MSPESDHEHG REATAGPAPG IPAQTGAGPA PAPGSGGRGD YPALGSWATV RLVAGRELGV 
RLRSKVFRIT TVALLVLLVG AAVVIDLVDG GDSTESVGVT AAESAITAPL TAAARSLDVG
ITTREVPDEA TGLRQVADGD LDALVTASPN GLRVAVKEDL NDEWRAVLAV VARQQVLDNE
ISVLGGDPAR VNEAVAATRV DVTQLDPAPA HQGERLVLGI AAALLIYMGL MLYGPAVSQG
VVEEKSSRVV ELLLSTVRPW TLMAGKVLGI GLVALIQMIV LAGGGLVAAL VTGALSLPSG
EATGTVIWSV VWYVIGFFLY ALPFAAVGAM VSRQEDVGGI SSPIVLAIVV PWVLGISIVP
GDPDNGLIAV LSLLPIFAPV LMPMRIALGV APVWQLVLSV VLALALIGVL IRLTGRIYRN
AVLRTGARVS FRDALREA