Gene Franean1_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2157 
Symbol 
ID5670557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2585966 
End bp2587519 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content64% 
IMG OID641241078 
Producthypothetical protein 
Protein accessionYP_001506499 
Protein GI158313991 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC GCCAGGAAAT AGACCGTGGA GTGCCGTACA ACATCCGAGA CAAAACCCAT 
CAGGTGGCTT TTACCCGCAC CGCGGCCGCG GCCTACCCAT CGAACGCCCC TTTCTCTCCG
GACACGTCCT ATCCGGAGTA CGACCTGGGC CACGTCAACG ACGAACCTAA TCCGGTGTAC
GCTGCCGTGC GCACGGTGCT CCGCCTGGCC GGGCTCGATC CGACGGCGGT CGACACGTCC
TCCTGGAACC CACTCGGCGA TCTGGTCGCC TCAGGTGGCA CCGTGGTGGT CAAACCCAAC
CTCGTACGCG AATCCCATCC CCGGGCTTCC GCAGGATGGA AGTGGGTGCT CACCCACGGC
TCCGTGATTC GGGCAGTAGC CGACTACGCT TTCCTGGCGG TAGGCCGGTC AGGACGGGTC
GTCGTGGCGG ACGCACCCCA GACCGATTCG TCATTTGCCG CGATTTCCAC CGTTCTCGGG
CTCGACCAGT TGAGCCGGTT CTATCTGGAC CGCGGATTGC AGTTCGAGCT CGTCGACCTG
CGCCAGGAGG AATGGACAAC GCGTGGCGAC GTCGTCGTGG CGCGCCACCG ACTCACCGGG
GACCCAGCTG GTGCCGTCGC CTTCGACCTC GGACACTCGA GCGAGTTCGT CGATCACGGA
GGATCGGGCC GCTACTACGG TGCCGACTAC GACTCACGGG TGGTCAATGA ACACCATTCC
GGAGGCCGCC ACGAGTACCT GCTCTCCGGG ACGGTTATGA ACGCAGATCT CATCATCAAC
ATACCCAAGC TTAAAAGCCA CAAGAAAGCC GGGATCACAC TCGGCATGAA GAACCTGGTC
GGTGTGAACG CGGACAAGAA CTGGTTGCCT CATCATACTG AAGGCTGGCC CGGAAACAAC
GGCGACGAGC ATCCTCGAGC CGACACGCGA CACCGCATTG AACGGAAGGC CGTGGCGGGC
CTACGTCGAG CCGCGCTGGC CTGGCCTCGA GTCGGCGGAC ATGTCATGCG TCTCGCCCGG
CAGGGCGGCA CACATGTCTT CGGTGACGGC GACACGGCCA TCCGCAGTGG CAACTGGTGG
GGCAACGACA CGGTCTGGCG AATGTCCCTC GACCTCAACA AGATCGTCAT GTACGGCCGG
GCGGACGGAA CGCTCTCCTC TCAGCCCACC GCACGCCGTC ATGTGGTGCT GGTGGACGGT
GTCATTGCGG GACACCGAAA CGGACCGCTG AACCCCGACG CGATCCCTGG TCGGCTTTTG
GCCTTCGGGC GCACGCCGGC CGCAGTGGAC GCCGCCACCA CCTACCTGTT CGGCTTCGAT
CCGGACCGGA TTCCGACTGT TCGACAGGCT TTCATATGCC GCCACCTCCC CCTGGCAGCA
GGGGACTGGC GCGACATCGA ACTGGTCGGA GACGACGAGA ATTGGTGTGG TCCGCTCAGT
TCCCTGGCCG CGGGGGTGAC CTTGCTCGCC GAGCCGCACT TCGCGTGGAA GGGCCGGGTG
GAACTCGTTC CAGCCCACGA TCATGCGGGA AACCCACGGG TAGGTACGAC ATGA
 
Protein sequence
MTERQEIDRG VPYNIRDKTH QVAFTRTAAA AYPSNAPFSP DTSYPEYDLG HVNDEPNPVY 
AAVRTVLRLA GLDPTAVDTS SWNPLGDLVA SGGTVVVKPN LVRESHPRAS AGWKWVLTHG
SVIRAVADYA FLAVGRSGRV VVADAPQTDS SFAAISTVLG LDQLSRFYLD RGLQFELVDL
RQEEWTTRGD VVVARHRLTG DPAGAVAFDL GHSSEFVDHG GSGRYYGADY DSRVVNEHHS
GGRHEYLLSG TVMNADLIIN IPKLKSHKKA GITLGMKNLV GVNADKNWLP HHTEGWPGNN
GDEHPRADTR HRIERKAVAG LRRAALAWPR VGGHVMRLAR QGGTHVFGDG DTAIRSGNWW
GNDTVWRMSL DLNKIVMYGR ADGTLSSQPT ARRHVVLVDG VIAGHRNGPL NPDAIPGRLL
AFGRTPAAVD AATTYLFGFD PDRIPTVRQA FICRHLPLAA GDWRDIELVG DDENWCGPLS
SLAAGVTLLA EPHFAWKGRV ELVPAHDHAG NPRVGTT