Gene Franean1_4802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4802 
Symbol 
ID5673143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5733475 
End bp5734704 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content72% 
IMG OID641243658 
Producthypothetical protein 
Protein accessionYP_001509074 
Protein GI158316566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.644094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGGC ACAGCACTCG CGTCGCGGTC GTCGGGGCCG CGGTGGCCGT CACCCTCGCC 
GGGGCGCTCG TCGGCACGGC GGCGGCGGAC GGGACCACGG TGGCCGACGT CACCTGGACG
GCCGCCAACT CCGAGGCCAC CGGCGACCAG GACAACGCGG AGGTGTCCGC GACCCGCAAC
GGCTACACCG CCGTGGTCTG GGAGGACGAC CGGGATACCA CGGCCCCCGA GGACACCCTT
CACACCGAGG TGTACCTCCG CCTGTACCGC GACGGGACGT CGCTGTACGA GAAGAAGCTG
TCGGCGGGCG GGAGCGGCAG CTGGCGGCAC GTCCAGCCCG ACGTCGCCCT GCGCGAGGAC
GGCACCGCGG TGGTCATCTG GGCGGAGGAC CCTGACGGCA ACGGGTACTA CAACATCGCC
GTGCGTGCGG TGAACACCGC GGGCACGGTG ACCGGCTCGG CCCAGGCCAA CGCGAACGCC
GACGGGCAGC AGCTCAACGC GCACGTCGCG GCGGACCCGG ACGGCCCCGG GTTCGCGGTC
GCGTTCGAGG ACGTCCAGGG CACCGCGGCA CCGACCGTGC GGGTGTCCGG GTTCGTGTCG
GTGTCGTCCA AGACCTACGA GGTCCAGGTG CACGCGACGG GCGGCACCCA CCGCCGGCCC
GACGTGGCGA CGGACGCCGC GGGCAACGCC GTCGTCGTCT GGGACGAGGA CGGTGACGGC
AACGGGTCGT TCAACATCGG CCGGAAGATC TTCACCTCCT CGGGCGGTGT GAAGGCGGCG
CAGTCCGTCG CCAACGTGAC GACCGCGGGG AACCAGCTCC ACCCGTCGGT GGCCGCGAAC
CTCAACGGCG ACCAGGTCGT CGCGTGGGAG ACCGACCAGA ACGGGAGCGC GCAGGTCGGC
GCCCGTTCCT TCAGCGCGGC GAACGCGGCC GGACCTGAGG TCGTCCTGCC CGGGGCGGAC
CCGCAGAGCG GCATCGACGA CCAGCGCAAC GCGGTGGTGT CCTGGGGCGA GTCCACCGAC
GTCCACGCCC AGGGTCTCAA CCCGGACGGC ACGGTCACCG GCCGACTGCC CCAGCTGCGG
GTCCACACCA CCGTCGCCGG CAAGCAGAAC GAGCCGGCGC TCGGCGTCAA CCCCTGGGGC
CAGATCGTGA TCGCCTACAC CGACGACAAC GACGGCAACG GCTTCGACCA GGTGTATCTG
GGCACCGGCC TGGTCAACAG CACCTGGTGA
 
Protein sequence
MRRHSTRVAV VGAAVAVTLA GALVGTAAAD GTTVADVTWT AANSEATGDQ DNAEVSATRN 
GYTAVVWEDD RDTTAPEDTL HTEVYLRLYR DGTSLYEKKL SAGGSGSWRH VQPDVALRED
GTAVVIWAED PDGNGYYNIA VRAVNTAGTV TGSAQANANA DGQQLNAHVA ADPDGPGFAV
AFEDVQGTAA PTVRVSGFVS VSSKTYEVQV HATGGTHRRP DVATDAAGNA VVVWDEDGDG
NGSFNIGRKI FTSSGGVKAA QSVANVTTAG NQLHPSVAAN LNGDQVVAWE TDQNGSAQVG
ARSFSAANAA GPEVVLPGAD PQSGIDDQRN AVVSWGESTD VHAQGLNPDG TVTGRLPQLR
VHTTVAGKQN EPALGVNPWG QIVIAYTDDN DGNGFDQVYL GTGLVNSTW