Gene Franean1_5897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5897 
Symbol 
ID5674218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7161416 
End bp7163059 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content76% 
IMG OID641244745 
Producthypothetical protein 
Protein accessionYP_001510147 
Protein GI158317639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0016856 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.41818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCT CATCGGGCCC GCCGCCCGTC CTGGGCACGC CACCCTCCCA GGCCGGGCCG 
GCGGCGGGAA ACCTACCGCT ACAGCCCGAC CCTGCCCAGC CTGATCGTCA CAATTCGCGA
CCTCATCGTG AGGATCGGTG GCGCGCGGAT CGGCCGTCCG CCGCGCCGCG CCGCTGGGGC
TTCGTGCGGC GCCATGCCGC CTTCCTGATC CTGCTCACGC TGGCCGTCGG GGCCCGGGCC
GCCGTCCTGC TGGCCTATCG GCCCGCGCTC TTCTACTACG GCGACTCGCC CGCCTACCTC
GACCAGGCCA CCAACCGGCT GTGGGCGGGC GACTGGCGGC CGTCGGGCTA CCCGATGTTC
CTGCGGGTCA TCGGCGCCCC CGACCACCTG ACCCGGCTGG TGGTCGTGCA GCACACGGCG
GCCCTGCTGG CCGGGGTCGC CCTCTACGCG GCGTCGCGTC AGCTGTTCGA GCGGCACGGT
CCCGTCGCGG CGGCCGGCCG CACCGGGGGC TGGCCGGCCG CGGTGGTCGC GGCGCCGGCG
CTGCTGGCCC CCTGGGTGCT CGACCTCGGC CAGTTCGTCC TGGCCGACAG CCTGTTCGGG
ACGCTCGTCC TCGGCGGGCT CGTGCTGCTG GCCTGGCCCG GGCGGCCGGC CGCGTGGCGC
TTCGCCGTCG CCGGGCTGCT GCTCGGCGCC AGCCTCACCG TCCGCACGGT CGGCTACGGC
CCGCTCGCGG TGGGCGCGGC CGGTGCCGTC GTCCTGGCTG TCACGCACTG GCGGCGGGCC
CACGCGGCGG TCGTGGCGGC GGGGGCCGTG CTCGCGTTCG TCCTGGGAGC GGCCGTCCCG
GTCGTCGCGT ACTCGGCGTG GAGCGCGGGG CAGGGGAAGG GCTTCACCGT CACCGCGCAC
TCGGGGTTCT TCCTGTACGG CCGGGTCGCC CCGTTCGCCG ACTGCGCCCG CCTGCCCGAC
GATCCCGACC TGCTGTCGCT GTGCGACCCG CGCCCGGTCG GCGAGCACGG CTCCCCGGTG
ACCTACCTCT GGCCGGACGA CTCGCCGCTG CGCCAGGGCA ACGACCTGGT CCCACCGGGC
CGCGAGGAGC TCGCCGGCGA GTTCGCCCGG CACGTGATCC GTGAGCAGCC CTGGACGATG
GTCACCTCGA CCGCCCGCTA CCTGGCCGGG TACTTCTCGC CCGTCCCGTA CGAGAACAGG
CTCACGAGCC GCGCCGACAC CTGGGAGCTG CCCCGGACGG GCACCAACCG CCTCGTCTCG
GACGGCCCGC ACGCCGCCGA CGGGTACTTC TCCGTCGCGC GGCTGAACGA CCCCCCGGTC
GAGCTGCTCG CCTTCTATTC CCGGCTCGGC TACGGGCCGA TGCCCCTGGT CGGCCTGGGT
CTGCTCGCCG GGCTGCTGGC TCAGATCGTC GGGCGGGTGC GCGGGCGCGC CGGCCCCGGT
CGGCTGTTCT GGCTGCTCGG GGGAGCAAGC CTGTCCACCC TGCTCCTGAG CTCCCTGACC
TCGGCGTTCG ATTACCGCTA CCTGGGATCG GTCGTCGGCC TGCTCGCCCC GGCCGCGCTG
CTCGGCGCGG CCGGGCTGGC ACGGGTGCTT CGACCGCGGC CGGTCGGCCG CGGCGGGAGC
GTCAACGAGG GAGGTATCGC GTGA
 
Protein sequence
MASSSGPPPV LGTPPSQAGP AAGNLPLQPD PAQPDRHNSR PHREDRWRAD RPSAAPRRWG 
FVRRHAAFLI LLTLAVGARA AVLLAYRPAL FYYGDSPAYL DQATNRLWAG DWRPSGYPMF
LRVIGAPDHL TRLVVVQHTA ALLAGVALYA ASRQLFERHG PVAAAGRTGG WPAAVVAAPA
LLAPWVLDLG QFVLADSLFG TLVLGGLVLL AWPGRPAAWR FAVAGLLLGA SLTVRTVGYG
PLAVGAAGAV VLAVTHWRRA HAAVVAAGAV LAFVLGAAVP VVAYSAWSAG QGKGFTVTAH
SGFFLYGRVA PFADCARLPD DPDLLSLCDP RPVGEHGSPV TYLWPDDSPL RQGNDLVPPG
REELAGEFAR HVIREQPWTM VTSTARYLAG YFSPVPYENR LTSRADTWEL PRTGTNRLVS
DGPHAADGYF SVARLNDPPV ELLAFYSRLG YGPMPLVGLG LLAGLLAQIV GRVRGRAGPG
RLFWLLGGAS LSTLLLSSLT SAFDYRYLGS VVGLLAPAAL LGAAGLARVL RPRPVGRGGS
VNEGGIA