Gene Franean1_6439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6439 
Symbol 
ID5674754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7829184 
End bp7830443 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content76% 
IMG OID641245287 
Productputative RNA-binding protein 
Protein accessionYP_001510682 
Protein GI158318174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.351487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0469485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGGG CTCCTTCGGC CGCGCTGCCA CGGCAGCGAC CCGCGGCCGT ACCGGTGGCG 
TTGCCTGGGC TGATTCCCTT CACGCCACCA CCAGCCGCCG CGTCCGCGCG CCGCGCCGGC
GCGGCGCACG CCGGCGCGCC GCCCGGCCTG GGTGACGCGC CGCCCGCCGG GGCGGGGGCC
GTGGCCTGGC TCGCCCGCGT CGGGTCGTGG CCCTCGCACG CCTTCGAGCG GGTGCGCCCG
CGGTGGGCGG GAGCGCGGCT CGGCCTGCTC GCCTACACGG TGGGCCAGGT CATCCTGCTC
GGGCTGCTCT ACGTCGGCTA CAGCCTGAGC CGGCACCTGG CTACCGGCCG CGAGCCCGAC
GCACTCGGGC ACGCTGTTGA CGTCTGGCGG CTGGAGCGGT TCCTGCGCCT GCCCGACGAG
GCCTCGCTGC AGGGCGTCGC GCTGGCGCAC CAGTGGCTGC CGCACGGGGC GAACTGGTAC
TACGTCGGCG TCCACTTCCC GGCGGCTATC CTGCTGCTGG TGTGGGTGTT CGCCCGGCAC
CGCGACCACT GGCGCCGAGT CCGCAACGTG ATCATCCTGG CGACGGGCGC CGGCCTGGGG
ATCCACCTGC TCTACCCGCT CGCGCCGCCG CGTTTCCTCC CACGGGTCGA CTCGTCGGTC
GGCCTGGTCG ACACCGGCAT GCTCTTCGGG CCGTCACCGT ACGGGAAGGG CAGCGGCGGG
GTCGCCAACC AGTACGCCGC GATGCCCAGT CTGCACGTGG GGTGGGCGAT CCTCGAGGCG
TGGGCGGTGG TCACCATCCT GCGCCACCGG ATGCGCTGGC TCGCGGTGAT CCAGCCGCTG
GCGACCGTCG CCGTGGTCGT GCTGACCGCC AACCACTTCT GGCTGGACGG CATCGTGGGC
GGCACCCTGG TCGCGGGCGC GGTGCTGCTC GTCGGCCGCC GTCGCCCGGT GCCGGAGCAG
CCCGCCCTGG TGAGCCTCGG GTTCCGGCCG GCCGTGGCCC CGCGGCCGGC CGCATCCGCG
GGGTCCGCGG AGGTCTCGGC TGTGTCCCCG GGTGTCTCGG CGGTGCCTCC GGCGGTCGTG
CCCTTGGCGG TCGTGCCCGC GGAGCCGGCA GTGCCCACGC AGTCATCAGC GCCCGTGGAG
CCGGCGGTAC CTGCGGAGCC GGCAGTGCCG CCCCAGCACG GGCTCGAGAG CCCCGGAACC
GGGAATGCCC CGGGGGACAG GCCGACTGCG CACGCCGATC CCGTGTCGGC CGTTACGTAG
 
Protein sequence
MVGAPSAALP RQRPAAVPVA LPGLIPFTPP PAAASARRAG AAHAGAPPGL GDAPPAGAGA 
VAWLARVGSW PSHAFERVRP RWAGARLGLL AYTVGQVILL GLLYVGYSLS RHLATGREPD
ALGHAVDVWR LERFLRLPDE ASLQGVALAH QWLPHGANWY YVGVHFPAAI LLLVWVFARH
RDHWRRVRNV IILATGAGLG IHLLYPLAPP RFLPRVDSSV GLVDTGMLFG PSPYGKGSGG
VANQYAAMPS LHVGWAILEA WAVVTILRHR MRWLAVIQPL ATVAVVVLTA NHFWLDGIVG
GTLVAGAVLL VGRRRPVPEQ PALVSLGFRP AVAPRPAASA GSAEVSAVSP GVSAVPPAVV
PLAVVPAEPA VPTQSSAPVE PAVPAEPAVP PQHGLESPGT GNAPGDRPTA HADPVSAVT