Gene Franean1_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3974 
Symbol 
ID5672335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4755739 
End bp4757163 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content75% 
IMG OID641242853 
Producthypothetical protein 
Protein accessionYP_001508270 
Protein GI158315762 
COG category 
COG ID 
TIGRFAM ID[TIGR02679] conserved hypothetical protein TIGR02679 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.261771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACGAG CCGTGAGCAC GGAGCCGGTG ATCTACAACA ATGTGACCGG CTCCGACCTG 
ACCGAGGTCG GGACAGCCGT CAGTGGGACG GTCGACATCG AACGGCTCGG CCGGCTGTTC
GGGCCGCCGA CCCAGTGGAT CATTACGCGG GCGCGGTCCC GGCTGGAGCG GGGCGGGTCC
CTCACCGGCG AGCTCACATT GCCGGACCCG TCCGAGCAGC AGCGCGTTGC CCTCGCCGGA
CTTCTGGGAC GCCGCCCTCG GTCGGGCCGT TCCCTGCGGG TCGACGTGGA CGACCTCGAC
ACGGTCGTGC GGCGCAGCGG ATGCGCGCCG AGCCTCGCGG CCGCCGTCGT CGCGCTCACC
GGTCCCATCG TCGACCGAGC CGCGCTCGCC GCCGAGACGG CCCGCGGCTG GCGGGCCGTG
TTCGCCTTGG CCGATCCTCT CGTGGACGCT CGCCCGGTCC TCGGGGCCTG GCGGGACCGG
CTTGGCACCA CCGGTCTGCT GCGCCGGCTC GCCGGTTCGC TGGAGGAGGG GCAGGCGCTG
ATGGCGGCGG CTGTATCCGT GCTGGGCGCG CTGCCGGCCG ACGGCGCGCC GATCTCGGTG
TTCGCCGCCC GTGTGCTGGG CGACGGTCAC GGCCTGGACG CCGATCGTCC CCTGTCCACC
CTGGTGCTCG ACGCCGTCGC ACTGCTCGGC CGGTCGATCA GCGCCGATGC CGACGATGCC
GGCGCTGGTC CGGACGAGCT GGCGGGTCGG ACTGCGCGGT CCACCGAGTG GCGGCGGGAG
GCCTGGGCCG CGGTCGGCGT CCTCGTCAGC GAGCTGGCAC TGCCCGTGCT GACGCTCGGG
TTGCCCGGAG ACCGGCACAG CGTGACTGGC CGCGTCCTGG ACCTGTGGCG GGCGGCCGGT
GAGCCCGTGC ACCTGTCGCT GCGCCAGCTG GTGCGTGACC CACCGATCCT GGACGCGCTG
GCCGGCGTAG CCGTGTTCGT CTGCGAGAAC CCGGCGGTCG TGAGCGCCGC CGCGGACCGG
CTCGAAGCCG GGTGCCGGCC GCTGGTCTGC GTCGGGGGTA TGCCGGCTGC GGCCGCCGCT
TCGCTGCTGC GCCTGCTCGC CGACGCGGGC GCCGTCCTGC GCTACCACGG CGACTTCGAC
TGGGGCGGGC TGGCCATCGC GAACACCGTC ACCACCCGGT TTGGCGCCGT CCCCTGGCGG
TACGACCGCG CCCACTACGA GCGGGCGCTG CGGCCCGGGC TCGCCGGGCT CACGGGACGC
CCCGTCGACG CCCGGTTCGA CCCCGACCTG AGCGACGCCC TTAGCGAGCA CCGCTATCGG
GTTGAGGAGG AGGCCGTCCT CGACGACCTG CTGGCCGACC TCATCTCCGA TCCGCCGGCC
GACCCCGGTA GCGCGCCGAG CGTCGCCAGC AGCGATCCCC ACTGA
 
Protein sequence
MGRAVSTEPV IYNNVTGSDL TEVGTAVSGT VDIERLGRLF GPPTQWIITR ARSRLERGGS 
LTGELTLPDP SEQQRVALAG LLGRRPRSGR SLRVDVDDLD TVVRRSGCAP SLAAAVVALT
GPIVDRAALA AETARGWRAV FALADPLVDA RPVLGAWRDR LGTTGLLRRL AGSLEEGQAL
MAAAVSVLGA LPADGAPISV FAARVLGDGH GLDADRPLST LVLDAVALLG RSISADADDA
GAGPDELAGR TARSTEWRRE AWAAVGVLVS ELALPVLTLG LPGDRHSVTG RVLDLWRAAG
EPVHLSLRQL VRDPPILDAL AGVAVFVCEN PAVVSAAADR LEAGCRPLVC VGGMPAAAAA
SLLRLLADAG AVLRYHGDFD WGGLAIANTV TTRFGAVPWR YDRAHYERAL RPGLAGLTGR
PVDARFDPDL SDALSEHRYR VEEEAVLDDL LADLISDPPA DPGSAPSVAS SDPH