Gene Franean1_4828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4828 
Symbol 
ID5673169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5767683 
End bp5768669 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content73% 
IMG OID641243684 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001509100 
Protein GI158316592 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00770957 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0433123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCT ATCTGTTCGG GCGGTTCCTG CAGGGGGCCT TCGTTCTCTG GGCCGCGTTC 
ACGCTCTCGT TCGTCGTGCT GTACGCGCTG CCGAGCGATC CGGCGGCGAT CATGATCGGG
CCGAGTAACT CCCTCACCCC GGCCGAGCTC GCGGCCCGCC GGCACGAGCT CGGCCTGGAC
CGGTCCCTGT TCGCGCAGTA CTTCGGGCGG CTGGGCGACC TCCTCCACGG CGACCTGGGC
CGGTCGGTGC AGTCCGGTCA GCCGGTGCGG GAGCTGATCG GGGACGCCCT GCCGCAGACC
GCCGCCATCA CGGGGCTCGG CCTGGTGGTC GGGGTGGCGC TCGGCGTGGG TCTCGCGGTC
GCGGCGACGC TGACCCACCG GCGCTGGCTG CGCCAGACGC TGCTCACCCT CCCCCCACTC
GGCGTGGCCG TCCCGAGCTT CCTCGTCGGG CTCCTGCTGC TGCAGTGGTT CTCGTTTCGG
TGGCAGCTGT TCCCGGCCAT CGGCAACAGC GGCTGGCGCA GCCTGGTGCT CCCCGCCGTC
ACGATCTCGC TGCAGCCCGC CGCCCTGATC GCGCAGCTGC TGGCGCGGAG CCTGGACAAC
GAGCTGCGGC AGAACTACGT CGACCTGGCC AGGGCGAAGG GCGCCGGACC GGCGCGGGTG
AACGTCCGGC ACGCGCTGCG CAACGCCGCG CTGCCGGCGC TGACCATCGC CGGCATCCTG
GTCGGCGGAC TGCTGGCCGG CGCGATCGTG GTGGAGGTCG TGTTCTCCCG CAACGGCCTC
GGCCGGATCT CCCAGGCCGC GGTCGACAGC CAGGATCTGC CGGTGGTGCA GGGCGTGGTC
CTGCTCGGCT CGGCGGTCTT CGTCGCCGTC AACCTTCTCG TCGATCTGCT CTATCCGCTG
CTCGACCCCC GCATCGCCCG GGACGGGAGC GCGGCCCGCC GCGCTCCGGA GGCGGTCGCG
GTCGGTCCGG CCCCGTCCGT AACGTGA
 
Protein sequence
MTRYLFGRFL QGAFVLWAAF TLSFVVLYAL PSDPAAIMIG PSNSLTPAEL AARRHELGLD 
RSLFAQYFGR LGDLLHGDLG RSVQSGQPVR ELIGDALPQT AAITGLGLVV GVALGVGLAV
AATLTHRRWL RQTLLTLPPL GVAVPSFLVG LLLLQWFSFR WQLFPAIGNS GWRSLVLPAV
TISLQPAALI AQLLARSLDN ELRQNYVDLA RAKGAGPARV NVRHALRNAA LPALTIAGIL
VGGLLAGAIV VEVVFSRNGL GRISQAAVDS QDLPVVQGVV LLGSAVFVAV NLLVDLLYPL
LDPRIARDGS AARRAPEAVA VGPAPSVT