Gene Franean1_4985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4985 
Symbol 
ID5673324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5980635 
End bp5981822 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content75% 
IMG OID641243839 
Producthypothetical protein 
Protein accessionYP_001509255 
Protein GI158316747 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.076001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTGAGCGCGG CGCGCCTCTG GTTCAACCAG ACCTGGCGCG GCACCTACCA GCTCATCGGG 
CTGCTGCGGG ACGGAGCCGG CCCCGGCCGG CTGACGGTGC TCGGTTCCCA CCAGATCCCG
AGCACCCCGT TCCTGCAGGC CTGCGACGCC GTCATCGACG AGCCACCGGG CGAGGGCGAC
GAGTTCGTCG AGCAGGCACT GGCGGCGTGC CGCCGGCACG GGATCGACGT GTTCGTCCCC
GGGCGCAACA TGTTGGACGT CGCCGCCCGG GTCGGCGAGT TCGAGGCCGC CGGAGTGCGG
GTCATGTGCT CCCCAGCCGC CTCGGCGCGG ATCTTCACTA CCAAGTCCGG GCAGTACGCG
GCGATGGCGG CCCGTGGCCT GCCGGTGCCC CACACCCGGA CGGTGACAAC CTTCGCGGAG
TTCGAGGCCG CCTGCGACGA GCTGTCGGCC GCGGGCTGCA CGGTCTGCGT CAAACCGGAC
GTCGACCACG GCGGCCAGGG CTTCCGGATC ATCGACGGGG ACGCCGAGCG CCTGACGGCG
CTGTTCGAGC CGCCGTCGGT GCGGGTCAGC CCCGCCACCA TGGAACGCAT CCTCGGCCGG
GCCGGCAGCT TCCCCGCCCT CGTCGTGGGC GAGTTCCTGG ACGGGCCGGA GTTCAGCGTC
GACGTGCTCT CCCGCCCCGC TCCCGGCACG AGCCCCGCTC CCGGGGCGGG CCCGGTGGCG
ATGCCCGGCA GTGTCCTGGC GGCGGTGCCG CGGGGCAAGG ACGGCCTGCC CTGGACCCGC
AACCTGCGGG CGGACGCGGC CGTGACGGAG CTCGCCACCC GCGTCGTCGA GGAGTTCGGG
TTGGCGTACC TGAACAACGT CCAGGTCCGC TACCGCAGAG CCACGCCGGT GCTGCTCGAG
GTCAACACGC GGGCCGCCTC CGGGACCTAC CAGTCGGCGG CGGCCGGGCT GAACCTGCCC
TGGCTCGCGC TCGCGCTCCT GCTCGGGGAG CCGGTAGAGG TGGGCTCGCC GGACCTCCCG
CAGACGCTCA TCGCCTACAA CGAGGCGATG GTCATGCGCC CGCTCGACCG CCTCTCGCCG
CGCCCGCGTG GTCACGGCGT GGCCCGGGAC GCCGCCCGTC GGCTGGGCTC GGCCGCCGGC
CGGGTGCACC GCCGCAGCAC CGGCCCCGAC CGAGCCCAAC CCGCCTGA
 
Protein sequence
MSAARLWFNQ TWRGTYQLIG LLRDGAGPGR LTVLGSHQIP STPFLQACDA VIDEPPGEGD 
EFVEQALAAC RRHGIDVFVP GRNMLDVAAR VGEFEAAGVR VMCSPAASAR IFTTKSGQYA
AMAARGLPVP HTRTVTTFAE FEAACDELSA AGCTVCVKPD VDHGGQGFRI IDGDAERLTA
LFEPPSVRVS PATMERILGR AGSFPALVVG EFLDGPEFSV DVLSRPAPGT SPAPGAGPVA
MPGSVLAAVP RGKDGLPWTR NLRADAAVTE LATRVVEEFG LAYLNNVQVR YRRATPVLLE
VNTRAASGTY QSAAAGLNLP WLALALLLGE PVEVGSPDLP QTLIAYNEAM VMRPLDRLSP
RPRGHGVARD AARRLGSAAG RVHRRSTGPD RAQPA