Gene Franean1_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1397 
Symbol 
ID5669804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1693342 
End bp1694673 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID641240322 
Producthypothetical protein 
Protein accessionYP_001505749 
Protein GI158313241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCATG ATCCGTCCGC GTTCGTCATA CCGGCGGACT TCTGGGATCG AACCCAGGTC 
GCCGAGAGCC TGCGGATGCG GGACATCGGT TCCCTGTTCC GGCTGGTGCA GAGGTACGCC
GGTGCAAGTC AGCCGGGGAT CGGGATGCGC GTCGGCCTGG CCCAGTCAGA TATCAGCAAA
TACATCAACG GCAAGCGAAT CGCTACCGAG TTCGAGTTGT TCGAGCGCGT CGCCGACGGG
CTCGACCTGC CCGACCGTGC CCGCATGCTG ATGGGGCTGG CACCGCGCGG CGCCGTCCAG
CTCCCGGAGG CCTCGCGGAC GACCGTCGTG GAACCAACTG CCCTTCCTGA GGAGCCGGAT
TCCGTCGAGG AAATCGGGCA GCGGATCGAG ACGCTCGGTA CATCCAACGT CAGCCCGGCC
GTTCTCGCCC ACTTCGACGT TCTGCTCCTG ACCATGGCGG ACGAGTACGA GTGGGCAGGC
CCGGAAAAAC TCGCGCCACG GATACTCAGG CAACGCCGCC GGGTTCAGAA TTTCCTGGAA
GGACGGCAGC CGCCCCGGCA GCGTGAACGG CTCTACGAAA TCGCCGGTCG GCTCTCCGGA
ATACTCGGCT ACATGGCGGT GAATACCGGC CGGTTCGGAC TCGCACGCGC CTACTGCCTG
GAAGCTCTCC ACACCGCCGA GTTGGTCGGT CACGACGACC TCACAGCATG GATACGCGGC
ACGCAGAGCC TGTGTGAGTA CTACGCCGGG GACTACCGAG CTGCTCTGGA TTTCGCTCGG
GAAGGTCGCC GTGTCGGCGG TCGGTCGGCC CAGGTAATCC GCCTCGCCGT GAACGGGGAG
GCACGCGCCC TCGGTCGACT CGGTGACCGC GCCGGTGTGG ACCGATCCGT GGGCGAAGCG
TTCGACCTCG CCGAACACCA TCCGGTGCCC GGCGGGATGT CGCCCTGCAT CTCCTTCGCG
CCTTACAGCA TCGCCCGTAT CGCCGCGAAC GCCGCGACCG CCTATGTCTC GCTGGGCGAA
CCCGGTCAGG TGCGGGAGTA TGCGGACATG GCCGCCCAGG TGGCAGACCG GTCCCCGTCG
ATGTGGAGCC GTTGCCTCGT CCGCCTCGAC CTCGCGACCG CGCTGCTGCT GTCCGACTCC
CCCGATCCGG AGCAGGCTGC CGTCCTGGGC ATCGAGGCTC TGACCGCCAC GGCCGGCAAC
CCGATCGAGT CCGTCCGACG CCGTAGCCAC GAGCTGGTCG CCCGTGCCAA GCCGTGGCAG
CAGATCGGCC CGGTCGCCGA ACTCGCGGAG GCTTCTCACG CCCTCGCCCT TCCCGCTGGT
GCACATCGGT GA
 
Protein sequence
MRHDPSAFVI PADFWDRTQV AESLRMRDIG SLFRLVQRYA GASQPGIGMR VGLAQSDISK 
YINGKRIATE FELFERVADG LDLPDRARML MGLAPRGAVQ LPEASRTTVV EPTALPEEPD
SVEEIGQRIE TLGTSNVSPA VLAHFDVLLL TMADEYEWAG PEKLAPRILR QRRRVQNFLE
GRQPPRQRER LYEIAGRLSG ILGYMAVNTG RFGLARAYCL EALHTAELVG HDDLTAWIRG
TQSLCEYYAG DYRAALDFAR EGRRVGGRSA QVIRLAVNGE ARALGRLGDR AGVDRSVGEA
FDLAEHHPVP GGMSPCISFA PYSIARIAAN AATAYVSLGE PGQVREYADM AAQVADRSPS
MWSRCLVRLD LATALLLSDS PDPEQAAVLG IEALTATAGN PIESVRRRSH ELVARAKPWQ
QIGPVAELAE ASHALALPAG AHR