Gene Franean1_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3478 
Symbol 
ID5671849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4135581 
End bp4136531 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content72% 
IMG OID641242366 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001507786 
Protein GI158315278 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.112233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCGG CCATCCGTCT GGTGGCGGTC CGCCTGCTCG GCGCCGTCCT GGTCATCTGG 
GGCGCGGTCA CCGCCGCGTT CGTGGTGCTC CAGCTCATAC CCGGGGACCC GATCAACGCG
ATCATCGGAA CGCACGCGCT GGTCGGGCCG GAACAGCGTG CTCAGCTCCG GGCCGAATAC
GGCCTGGACG ACTCGCTGTT CGCGCAGTAC CTGGACCACA TGGGACGGCT GGCCACCGGT
CGCCTCGGCG ACTCCTACCA GCTCCAGCAG CCGGTCTGGA CAGTGATCAC CGACCAGGCC
GGCGCCACCG TCGAGCTCGC CGGATGGGCG ATGTTCTCCG CGGTGGTGCT CGCCGTGGCC
GTGACCCTGC TGACCTCCGG GCGGGCCCGC TGGCCGCGCC GGATCAGCTC GCTGCTGGAG
CTGGTCGTCG TCTCCACACC CCAGTTCTGG CTGGGGATCC TGCTGCTCAC CGTCTTCTCG
TTCCATCTCG GCTGGTTCCC GGTGGCCGAC ACCGGGGACC CGCGTTCGCT GATCCTGCCG
GTGGTGACCC TCGCGCTGCC GATCGCGGCG GTGCTCATCC AGGTGATGCG CGAGGGGCTG
CTCTCCGCTC TGGAGGCGCC CTTCGTGCTG ACCGCGCGGG CCCGCGGCAG CGCCGAGTAC
TCGGTGCGCG CGCGACACGC GCTGCGGCAC GCGAGCCTGC CCGCGCTGAC CCTGTCGGGC
TGGTTCGTCG GCACGCTGCT CGGCGGGGCC GTGATCACGG AGAACGTCTT CGCCCGCTCC
GGCATCGGGC GGGTGACCCT GCAGGCGGTC GCCAACCGGG ACTTTCCAGT CGTGCAGGGG
GTGGTCGCGC TGTCGGCGGT GGTGTTCGTC GCCGTCAGCG CCCTGCTGGA ACTGCTGTAC
GCGGTGGTCG ACCCGCGGCT GCGCAAGCGG ACGGGGGTGG CCGCGGCATG A
 
Protein sequence
MHPAIRLVAV RLLGAVLVIW GAVTAAFVVL QLIPGDPINA IIGTHALVGP EQRAQLRAEY 
GLDDSLFAQY LDHMGRLATG RLGDSYQLQQ PVWTVITDQA GATVELAGWA MFSAVVLAVA
VTLLTSGRAR WPRRISSLLE LVVVSTPQFW LGILLLTVFS FHLGWFPVAD TGDPRSLILP
VVTLALPIAA VLIQVMREGL LSALEAPFVL TARARGSAEY SVRARHALRH ASLPALTLSG
WFVGTLLGGA VITENVFARS GIGRVTLQAV ANRDFPVVQG VVALSAVVFV AVSALLELLY
AVVDPRLRKR TGVAAA