Gene Franean1_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0428 
Symbol 
ID5668851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp503841 
End bp505589 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content70% 
IMG OID641239360 
ProductABC transporter related 
Protein accessionYP_001504799 
Protein GI158312291 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG AACGGGTAGT GGGCGACACC CGGCGCCGGG TGTTCCGGCA ACTCCTGCGC 
AACCCGCAGG GCGCCGTGTG CCTGGCCTTT CTCGCCCCGG TCGTGCTGGT CGCGGTGACC
AGCCAGTGGC TGGCGCCGTA CTCGCCGACC GAAACCGACC TCGACGCCAC CAACGCCGCG
CCGTTCAGCG ACGGGCACCT GCTCGGCGGG GACAGCGCCG GCCGCGACAT CCTGTCCCGG
CTCATGTGGG GCTCGCGGCA GACCGTCCTG GCGTGCGCCA TCATCCTGGT GATCTCGCTG
GCTGTCGGTG TCACCAGCGG CTTGGTGGCC GGCTTCTATC GTGGGCGGTT CGAGGTCGCG
GCGGGCTTCG TCTCCGACGT GGTCATGTCG CTGCCCGGCA TCGTTCTGCT GATCGCCCTG
TACGCCCTGA CCGGCCCGAA CATCCCGGCC GCGATGGCCG TCTTCGGCCT GCTGATCGCC
CCCACCTACT ACCGCCTGGT CCGCAACGTC GTGCTGGGCG TGCGCAACGA GCTCTACGTC
GACGCGGCCC GCGTGGCCGG CCTGTCCGAC CTGCGGATCG TCGGCCGGCA CGTGCTCTGG
GCCGTACGGG CGCCGGTCAT CATCCAGAGC TCGTTCGTGC TGGCCGCGGG CATCGGCATC
GAGGCCGGCG TGTCCTTCCT CGGCCTCGGT GACCCGGCCG GGGCGTCGTG GGGCATCGTG
CTGCAGAACT CGTTCAACGG CATCTACAAC AACCGGTGGG CCGTGGTCTG GCCGGCGCTG
CTGATCAGCC TCACCATCCT GGCCCTGGTG CTGCTCGGCA ACGCGCTCAG CGACGTCCTG
CAGTCCTCGG CCCGCAGCAA GACACTGTCG CTACGGGCCC GCCGCGCTGC GGTGACGGCG
GCGCAGCAGG TCGAGCCGAC CGAGGACTCG ATCCTCGTCG GGAGCTCCAA CGATGTCGTG
CTCTCCGTCC GGGGACTGCG GGTCGCCTAC CCGATGGCCG GCGGGGAGAT CCGGGAGGTA
GTGCACGGCG CCGACCTGGA TGTGCGGCGC GGCGAGATCC ACGGCCTGGT GGGGGAGTCC
GGCTCCGGCA AGTCGCAGAT CGCCTTCGCC ACCCTGGGCC TCCTCCCGCG CGAGGCGCTG
GTCCTGGGCG GCAGCGTCCT GCTCGACGGC GAGGATCTGC TGGCCGACGT CGCGAAGATG
CGGGCGGCCC GCGGGCGCCG CATCGCGTAC GTTCCGCAGG AGCCGATGTC CAATCTGGAC
CCGTCCTTCA CCATCGGCAA ACAGTTGACG TACGGGCTGC GCGCGGTAAC CACTCTCGAC
GCGAAACAGG CCCGGGAGCG GATCATCAAA TTACTGGTCC GGGTGGGCAT CACCGATCCC
GAGTGGGTGA TGGGCCTGTA CCCGCACGAG ATCTCGGGTG GCATGGCCCA GCGCGTGCTG
ATCTGCGGCG CGGTCGCCGC CGACCCGGAC GTCATCGTGG CCGACGAGCC CACCACCGCC
CTCGACGTCA CCGTCCAGGC CGAGGTCCTC GAACTGCTGC GCGAACTGAG CCAGGAGCGC
GGTCTGGCCA TGATCCTGGT GACCCACAAC CTCGGCGTCG TCGCGGATCT GTGCGACACC
GTGAGCGTGA TGAAGGAAGG AAACATCGTC GAACGCGCCG ACGTCGACGC CATCTTCGAG
TCCCCGCAAC AGGCGTACAC CCGGGAACTG CTCTCGTCCT CCCGCAGCGT CGAGCTGATG
GAGATCTGA
 
Protein sequence
MTAERVVGDT RRRVFRQLLR NPQGAVCLAF LAPVVLVAVT SQWLAPYSPT ETDLDATNAA 
PFSDGHLLGG DSAGRDILSR LMWGSRQTVL ACAIILVISL AVGVTSGLVA GFYRGRFEVA
AGFVSDVVMS LPGIVLLIAL YALTGPNIPA AMAVFGLLIA PTYYRLVRNV VLGVRNELYV
DAARVAGLSD LRIVGRHVLW AVRAPVIIQS SFVLAAGIGI EAGVSFLGLG DPAGASWGIV
LQNSFNGIYN NRWAVVWPAL LISLTILALV LLGNALSDVL QSSARSKTLS LRARRAAVTA
AQQVEPTEDS ILVGSSNDVV LSVRGLRVAY PMAGGEIREV VHGADLDVRR GEIHGLVGES
GSGKSQIAFA TLGLLPREAL VLGGSVLLDG EDLLADVAKM RAARGRRIAY VPQEPMSNLD
PSFTIGKQLT YGLRAVTTLD AKQARERIIK LLVRVGITDP EWVMGLYPHE ISGGMAQRVL
ICGAVAADPD VIVADEPTTA LDVTVQAEVL ELLRELSQER GLAMILVTHN LGVVADLCDT
VSVMKEGNIV ERADVDAIFE SPQQAYTREL LSSSRSVELM EI