Gene Franean1_7301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7301 
Symbol 
ID5675602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8922425 
End bp8923945 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content74% 
IMG OID641246138 
Producthypothetical protein 
Protein accessionYP_001511526 
Protein GI158319018 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0388032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTCCG GTGCCTTGAC CGGCGACCCG ACCACCACCC GTGACCCTGC CGGCCCGACG 
GCCGCTCCTG CCCTCGACGC GGAGGCGCTG GCGGCGGGGC TGGCGGCGGC CCGGGACCGG
TCGCTCGCCT ACACCGACCT CACCGAGGAC GACCTCCTGC GCCAGCACTC GCCACTAATG
TCACCGCTGG TCTGGGATCT CGCGCACATC GGCAACTACG AGGAGATCTG GCTGCTGCGC
GCCCTGACCG GCGCGGCCGA GCTGCGCCCC GGCCTGGACG ACGTCTACGA CGCCTTCCGG
CACAGCCGGG CCAGCCGGAC CGCGCTGCCG CTGCTCGGCC CGGACGAGGC CCGCGCGTAC
CTGCGGGACG TCCGCGGGCG GGCGATGGAC GTCCTGACCG GGCTGGACCC CGACCTGCTG
CACGCCCCGG AGACGGCCGG CGAGGCCGCC GGACCCGAGG CCGGGAACGG GAACGAGAAC
GGGGCCGCGA GCAGGAACGG GGCCGCGAAC GGGAACGGTG AGCGGGACGG CGGCGCGGCC
GTCCACCGGC GTGCCCGGCT GCTGTCGGGG TCGTTCGTCT ACGGCATGGT GATCCAGCAC
GAGCACCAGC ACGACGAGAC GATGCTCGCG ACGCTGCAGC TGCGCGCGGG CCCGCCCGTC
CTGGCCGACC CGCGGGCGGC GCCGCCATCC CCGGCCGGGC CGGTGGACGA GGCGGCCGAG
GTCCTGGTGC CGGCCGGCGA GTTCACCATG GGCACGTCGA CCGAGCCGTG GGCGTACGAC
AACGAGCGGC CGGCGCACCG GGTGTTCCTG CCCGCCTTCC GCCTCGGCCG CTACCCGGTC
ACCAACCGGG CCCACCTGGC GTTCATCGCG GACGGCGGCT ACGCCGACGA GCGGCTCTGG
TCCGCCGAGG GCTGGGCGTG GCGCTGCCAG GAGGGCCTGA CGGCGCCGCT TTTCTGGTCA
CGCGACGGCG ACGTCTGGAC ACGCCGCCGA TTCGGGCGGG TCGAACCGGT GCCTCTCGAC
GAGCCCGTGC AGCACGTGTG CTGGTATGAA GCCGAGGCCC ACGCCCGCTG GGCCGGACGA
CGGCTGCCCA CGGAGGCCGA GTGGGAGAAG GCGTGTGCGC ACGACCCCGT CACCGGGCGT
TCACGACGGT ATCCATGGGG CGACACCGAC CCGACTCCCG AGCTTGCGAA CCTGGGGCAC
CGCACTGCGC GGCCCGCACC GGTGGGCGGC AGGCCGGCCG GCGCGAGCCC GTACGGCGCC
GAACAGATGA TCGGGGACGT CTGGGAATGG ACCGGCGGCG GGTTCACCCC TTATCCGGGC
TTCGCGTCGT TCCCGTACCG GGAGTACAGC GAGGTCTTCT ATCCTCGCGA TGCGGCCCCG
GCCCGGTTCC GGGTGCTGCG CGGCGGCTCC TGGGCGACCG ATCCCAGCGC CGTCCGCTCC
ACGTTCCGCA ACTGGGACTT TCCGATCCGA CGGCAGATCT TCGCCGGGTT CCGGCTCGCC
CGCGACGCCG ACACCCCCTG A
 
Protein sequence
MRSGALTGDP TTTRDPAGPT AAPALDAEAL AAGLAAARDR SLAYTDLTED DLLRQHSPLM 
SPLVWDLAHI GNYEEIWLLR ALTGAAELRP GLDDVYDAFR HSRASRTALP LLGPDEARAY
LRDVRGRAMD VLTGLDPDLL HAPETAGEAA GPEAGNGNEN GAASRNGAAN GNGERDGGAA
VHRRARLLSG SFVYGMVIQH EHQHDETMLA TLQLRAGPPV LADPRAAPPS PAGPVDEAAE
VLVPAGEFTM GTSTEPWAYD NERPAHRVFL PAFRLGRYPV TNRAHLAFIA DGGYADERLW
SAEGWAWRCQ EGLTAPLFWS RDGDVWTRRR FGRVEPVPLD EPVQHVCWYE AEAHARWAGR
RLPTEAEWEK ACAHDPVTGR SRRYPWGDTD PTPELANLGH RTARPAPVGG RPAGASPYGA
EQMIGDVWEW TGGGFTPYPG FASFPYREYS EVFYPRDAAP ARFRVLRGGS WATDPSAVRS
TFRNWDFPIR RQIFAGFRLA RDADTP