Gene Franean1_2480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2480 
Symbol 
ID5670876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2957685 
End bp2959325 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content71% 
IMG OID641241397 
Productputative ribonuclease BN 
Protein accessionYP_001506818 
Protein GI158314310 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG0778] Nitroreductase
[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.422158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCG GCGCTCGGCT GGATCGGGCA CAGCAGAGCA GGCCGAGGAT CGCATTCCCG 
CTCGCGGTCG TCTACAAGTT CGCGGAGGAC CAGGGCGGCT ACCTCGCCGC GCTGATCGCG
TTCTACGGAT TCCTCTCCCT GTTTCCGCTG CTCCTGCTGC TCACGACCGG CCTCGGCTTC
GTCCTCGCGG GGCATCCCGA CATCCAGGAG CAGGTGGTGA GCTCCGCGCT CAGCCAGTTC
CCGATCATCG GTGACCAGCT CCGCAGCGAC GTCCAGGCCC TGCGCGGGAG TGCCGTGGCG
GTGGCGATCG GTGTGCTCGG CAGCATCTGG GGCAGCCTCG GGGTGGCCCG CGCGCTCGGG
AACGCGCTGG ACACGGTCTG GGCGGTGCCA CGGCGCTCGC GGCCGAACCC CTTCTTCGCC
CGGGTGCGCA GCTTCGGCCT CATCGGCTTG TTCGGGCTCG GTGTCGTGCT GACGACCCTG
CTGTCCGCGA TCACCACCCG GGCCGGCGAC CTCGGCACCG GTCTCGGTGC CGGCGCGCAG
GTTCTCGCGG TGGTGCTCGG CATCGCCGGG AACACGGGCC TGATCCTGAT GGCGTTCCGG
CTGCTCACGG TCAAGTCGGT GACGTTCGGC CAGATCCTGC CCGGAGCCGC GATCGCCGCG
CTGGGCTGGC AGCTGCTCCA GTCGGCCGGG ACCTACCTCC TCCAGTACCA GCTACAGGGG
CGCACGCAGG TCTACGGCCT GTTCGCGCTG GTCCTCGGCC TGATGACCTG GCTGTACCTG
CTCGCCGCGG TGATCGTGTT CGCGATGGAG ATCAACACCG TGCGCGCCGA ACGGCTCTAC
CCGCGCGCGC TGCTCACCCC GTTCGTGGAC GACGTCGTCC TCACCGACTC CGACCGGCGG
GTCTACACCT CGTACGCCCA GGCGGAGCAG TTCAAGAGCT TCCAGCAGGT CGACGTCTCC
TTCGACGACG TCTCCGTCGG TGACGCCTCG TCCAGTGACG CCTCCGTCGA CCAGGATCGA
CCCATGGAGC TGACACACGC GATGCGCACG ACGGGCACCT GCCGGCGGTT CCGACCCGAC
CCGGTGCCCG ACGACGTCCT CGTCGCCGCG TTCGACGCCG CCCGGTTCGG CCCGCAGGGC
GGGAACCGCC AGCCGGTGCG GTTCGTGGTC GTCCGGGACC CGGAGCGCCG GCGGGTGCTC
GCCAACCTCT ACCTGGCCCG CTGGCAGCCC TACCTCGACG AGCGCGGGAT CAGTACGCCG
ACCGAGGCCG ACCACTTCGC GCGCACCCTG GCGGACGTCC CCGTGCTGAT CGTGGTGTGC
GCGAAGCTCG CGGCGCTGCA TCCCACCGAC ACCGAGCTCG ACCGGCTGAG CATCGTCGGC
GGGGCGTCGG TCTACCCGAT CGTGCAGAAC CTCTGCCTCG CGCTGCGCGG CGCCGGGGTG
GCCACCGCCC TGACGACGCT GCTGGTCGCC GACGAACCGA AGGTCGCCGA GCTGCTCGAC
ATACCGGACG GCTACGCGAC CGCCGCGCAC CTCGCGGTCG GCTATCCCGA GCGCGGATTC
CCCAGCAACC TGCGGCGCCG TCCCGTCGAG GAGCTCGTCT TCGGGGAGGC CTTCGGACGC
CCGCTGGGCG AGGCTGGATG A
 
Protein sequence
MALGARLDRA QQSRPRIAFP LAVVYKFAED QGGYLAALIA FYGFLSLFPL LLLLTTGLGF 
VLAGHPDIQE QVVSSALSQF PIIGDQLRSD VQALRGSAVA VAIGVLGSIW GSLGVARALG
NALDTVWAVP RRSRPNPFFA RVRSFGLIGL FGLGVVLTTL LSAITTRAGD LGTGLGAGAQ
VLAVVLGIAG NTGLILMAFR LLTVKSVTFG QILPGAAIAA LGWQLLQSAG TYLLQYQLQG
RTQVYGLFAL VLGLMTWLYL LAAVIVFAME INTVRAERLY PRALLTPFVD DVVLTDSDRR
VYTSYAQAEQ FKSFQQVDVS FDDVSVGDAS SSDASVDQDR PMELTHAMRT TGTCRRFRPD
PVPDDVLVAA FDAARFGPQG GNRQPVRFVV VRDPERRRVL ANLYLARWQP YLDERGISTP
TEADHFARTL ADVPVLIVVC AKLAALHPTD TELDRLSIVG GASVYPIVQN LCLALRGAGV
ATALTTLLVA DEPKVAELLD IPDGYATAAH LAVGYPERGF PSNLRRRPVE ELVFGEAFGR
PLGEAG