Gene Franean1_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3922 
Symbol 
ID5672283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4689701 
End bp4690654 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content67% 
IMG OID641242801 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001508218 
Protein GI158315710 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0207094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCA CTGTTGCGAG ACGTGTAGGC CGTAGCCTGC TGGTGGTCTT ACTCGTGACC 
ATGGGGGCCG TGGCGCTACT GAGCCTGGCT CCGGGGTCAG CGGCCTCCGT CATCCTGGGG
GAGAACGCGA CACCGGACGC CATCGCCGCT CTGAACGCCA AGCTGGGACT CGATCGGCCT
CTGTGGTCGC AGTACACGCA CTGGCTCGGC AACGCCGTCA CCGGCGACCT GGGCACCTCC
CCGGTGACCA ACCAACCCGT CCTGGACGCG ATCGTCGAGC GGCTGCCGGT CACCCTGCAG
CTCGCCGCGA TGGCTCTGGT CTTCGCGCTG CTTGTCGCCG TGCTGCTGGC GGTGGTGTCC
GCGAGCTGGC CGGGCACTCC GATCGACCGG GCGATCACCG CGCTGTCCTC GGTGTTCCTC
TCCGTACCCG CCTTCATCGC GGGCCCGGTC ATGATCTACT TCTTCGCGCT GCAGGCCGGG
TGGTTCCCGG TGACGGGCTG GTCGCGCATC AGCGAGGGCC TCGGAGACAA CCTCCGCAGC
GCCATCCTGC CCGTCCTCGC GATCTCCCTG ACCGAGATCG CGTCCTTCCA CCGGCTGCTG
CGCACCGACC TGGTCGGCAC GCTGCGTGAG GACTTCATCG GGGCGGCCCG CGCCAAGGGC
ATGAGCCCGT CCTACGTGAT GTCCCGTCAC GCCCTGCGGC CGTCGTCGTT CTCCCTGGTC
ACCCTCGTGG GCATCAACCT GGGCCGGCTG ATCGGCGGAA CGGTGATCGT GGAATCCCTG
TTCTCGCTGC CGGGCCTGGG CCAGCTCGTC GTCTACTCGA TCACTGCCCG CGACATCATC
ACCGTCCAGG GCATCGTCGT GTTCATCGCC GTGGTCTACG TGGTCATCAA CATGCTCGTG
GATCTCAGTT ACGGCTGGCT CGACCCGCGC GTCAGGAAGG CTGCCAACGC ATGA
 
Protein sequence
MLLTVARRVG RSLLVVLLVT MGAVALLSLA PGSAASVILG ENATPDAIAA LNAKLGLDRP 
LWSQYTHWLG NAVTGDLGTS PVTNQPVLDA IVERLPVTLQ LAAMALVFAL LVAVLLAVVS
ASWPGTPIDR AITALSSVFL SVPAFIAGPV MIYFFALQAG WFPVTGWSRI SEGLGDNLRS
AILPVLAISL TEIASFHRLL RTDLVGTLRE DFIGAARAKG MSPSYVMSRH ALRPSSFSLV
TLVGINLGRL IGGTVIVESL FSLPGLGQLV VYSITARDII TVQGIVVFIA VVYVVINMLV
DLSYGWLDPR VRKAANA