Gene Franean1_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3102 
Symbol 
ID5671481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3660747 
End bp3662210 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content66% 
IMG OID641242000 
Productradical SAM domain-containing protein 
Protein accessionYP_001507420 
Protein GI158314912 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTGC TGCTGGTCCA TCCCAGTGCG TTGATGTATT CGGAGATCTT CCTGCGACTC 
GAGCCTCTGG GTCTTGAACG CGTGGCCGCA TCGCTGCTCA TGGCCGGGCA CGAGGTCCGC
CTGATCGATC TTCAGACCGC CGACATACGG GACTACACCC GAGCGTTGGT GGATTTCCGG
CCGCAGACCG TGATGTTCGG CCTGAATTAC CTGGCCAACG TCCCAGAAGT CATCATGCTC
GCGAAACAGG CCAAAATCAC CCGGCCAGGA TGTTTGGTGA TCGCCGGTGG TCACAGCGTC
TCGTTCATCG CCCAGCACCT CCTCGAGAAC TGCGACGAAG CAATCGATGC GGTGGCCCGG
GGAGAGGGCG AGGTCGTCGC TCCCCGGATC CTCGAAGCCG ACTGGGACAG CCTCACCGAG
GTCCCGGGAG CGGTCACCCG CGCCGGGTCC GGGCCCCCGC CGACCATGCT GCCCACCCTC
GACGAGCCGC TCCCGGCCCG TTACCTGCTC GCCCGCCGCA ACCGGTACTT CATCGGCGAG
CTCGACCCGT GTGCATCGGT GGAGTTCACC CGCGGCTGCC CCTGGGACTG CTCGTTCTGC
AGTGCCTGGA CGTTCTACGG GCGCAGCTAC CGGCGGATGT CGGCGGACGC GGCCGGACAC
GAACTCGCCT CCATCCGCGA GCCCAACGTC TTCCTCGTCG ACGACGTGGC CTTCATCAAA
CCCGACCACG GCAATGCCAT CGCCGACCAA ATCGAACGCC GCGGCATCCG AAAGCGCTAC
TACCTGGAGA CCCGCGCGGA CGTCCTGCTG CGCCATCCCG AGGTCTTCCA ACGCTGGCGC
CGGCTCGGCC TGACCTACAT GTTCCTCGGC ATGGAGGCCC TCGACGCCGA GGGACTCGAC
CTGTTCCACA AGCGCATCTC CCCCGACGAG AACATCAAGG CCCTCGAACT CGCCCGCAAG
ATCGGCATCA CCGTGGCGGT GAACCTCATC GCCGACCCCG CCTGGAGCCG TGACCAGTTC
CGGCTGGTCC GACAATGGGC CCTGTCCGTA CCGGAAATCG TCCACCTGAC CGTCATGACG
CCCTACCCGG GCACCGAGAT CTGGCACACC CAGTCCCAGA AACTGACCAC GCTGGACTAC
CGCCTGTTCG ACATCCAGCA CGCGGTCACC CCGACCAGCC TTCCCCTCGA CGAGTTCTAC
CGCGAACTCG TCGCGACCCA GGCCGTACTG AACCGCAAGC ATCTCGGCGT CAAGGCACTG
GCCGCTACCG CACGTATCGT CGCCGGGCAT CTCACCCACG GTCAGACGAA CTTCCTGCGC
ATGCTCTGGA AGTTCCCACG GGTCTACAAC GCCACGCGGC TCCACGCCGA ACACGGCCAG
CCCGCCCGCT ACTGTCTGCC CGCACCGACC CACGCAGGGG TGAGCCGCCG GCGACGCGAG
CTGTACATTC ACCAGCCTAT TTAG
 
Protein sequence
MRVLLVHPSA LMYSEIFLRL EPLGLERVAA SLLMAGHEVR LIDLQTADIR DYTRALVDFR 
PQTVMFGLNY LANVPEVIML AKQAKITRPG CLVIAGGHSV SFIAQHLLEN CDEAIDAVAR
GEGEVVAPRI LEADWDSLTE VPGAVTRAGS GPPPTMLPTL DEPLPARYLL ARRNRYFIGE
LDPCASVEFT RGCPWDCSFC SAWTFYGRSY RRMSADAAGH ELASIREPNV FLVDDVAFIK
PDHGNAIADQ IERRGIRKRY YLETRADVLL RHPEVFQRWR RLGLTYMFLG MEALDAEGLD
LFHKRISPDE NIKALELARK IGITVAVNLI ADPAWSRDQF RLVRQWALSV PEIVHLTVMT
PYPGTEIWHT QSQKLTTLDY RLFDIQHAVT PTSLPLDEFY RELVATQAVL NRKHLGVKAL
AATARIVAGH LTHGQTNFLR MLWKFPRVYN ATRLHAEHGQ PARYCLPAPT HAGVSRRRRE
LYIHQPI