Gene Franean1_4270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4270 
Symbol 
ID5672625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5103107 
End bp5104543 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content68% 
IMG OID641243143 
Producthypothetical protein 
Protein accessionYP_001508560 
Protein GI158316052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.680526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.470302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGACT TCAGCCACCT CAGTTTCGGT GCGCCGGCCG CCGAACGCGA CATCAGGCGC 
GGGCTCGACG CCTACTTCAT CGAGTCGGCC GCGTACCGCA ACATGAACAC CGGAACGAAG
ACGATCCTGG TCGCGAACCG TGGTGCCGGC AAGAGCGCGA TCCTCAAGAC CATGGCCCGC
CGGCACCGCG ACCGCGGGGC CTCGGTCATC GAGCTCGCAC CCGAGGACTA CTCGTACGAA
TTCCTCAGCG GGGCGATGAC CCGCGAGGCC GACGGCGCGT GGGCCAAGCT CGGCGCGTAC
GCCGTGGCCT GGAAATACCT GCTGCTCGTG CTCATCATGA AGGAGCTGAC GAAACGGCAC
GGCCGGATAA GACGCGGCGC CGAGAGCCAG ATATACGCCT ACCTGCGCGA CAACCACCGC
GGCGCGTCGG TCACCCCGCT GGACTCGTTC GTCTCCTATC TGCGCCGCAT CGAGTCGGTG
AAGCTGGGAA GCTTCGAAGC CGGTATCCGC ACGACGGAGC TCCAGCGCCT CTACAAACTC
CAGGAGCTGG AACACCTGAT CCAGCCGTTG AGCCGCCTGT GCGCGCGCTC GGGGGTCACC
GTCCTGGTCG ACGAGCTGGA CCGCGGCTGG GACGCGAGCG AGGACGCGCA GGCCTTCGTC
GCCGGTCTCT TCCAGGCCTG TATGTCGCTG AACGACCTCT CGCCGCACCT GCACGTCTTC
ATGTCGCTGC GCCAGGAGCT CTACGACAAC ATTCCCGCCC TCTACGACGA CGCGCAGAAG
TTCCGCGACC TGATCGAGGT GGTGCGGTGG GACGAGCCGC ACCTGTGGCA GCTGATCGCC
TGCCGTATCC GGCACACCGT GCCCGGGCTG AGCGGTGTGG GCGACGACGA GTGCTGGGCG
GCGGTGTTCC GTGATCCGCG CTGGTCGTTC CGCTACATCG TCGACCGCTC GCTGCGCCGC
CCACGCGAGA TCATCCAGTA CTGCAGCCAC GCGCTCGAAC ACGCGCGTCA GAACCGGTCC
GCCGGTGGCC AGCGGATCGC GCGGCGCGAC ATCCTCGCCG TAGAAGCCAC CTACTCCGGC
GAGCGGACCC GCGACATCGC CGCCGAGTAC CGGTTCCAGC ACCCGGGCCT GCTCAGCGTG
TTCGAGGCGT TCCGCGGCCG CCCGGCGGTC TGGGCCCGGG ACGACCTCGA GTTCCTGCTG
CTGGACATCG CGACGGGAGC CGTGCGGACC AGCCGGGAGG CCACGAAGTG GATCGCCGAC
CGGGATCCCG ACCATCTGCT GGAGACACTG TGGAACGTGG GTTTCATCCG CGACCACTCA
GTGCTGTCCG CATCGTGGGC GGGGAGCGAT CGCTCTGACT CCCTCACCTC GACCAGGGGT
GTCTCGACCT TCGCCGTGCA TCCGATGTTC CGGCAGTACG TGGGAACCCT GGACTAA
 
Protein sequence
MPDFSHLSFG APAAERDIRR GLDAYFIESA AYRNMNTGTK TILVANRGAG KSAILKTMAR 
RHRDRGASVI ELAPEDYSYE FLSGAMTREA DGAWAKLGAY AVAWKYLLLV LIMKELTKRH
GRIRRGAESQ IYAYLRDNHR GASVTPLDSF VSYLRRIESV KLGSFEAGIR TTELQRLYKL
QELEHLIQPL SRLCARSGVT VLVDELDRGW DASEDAQAFV AGLFQACMSL NDLSPHLHVF
MSLRQELYDN IPALYDDAQK FRDLIEVVRW DEPHLWQLIA CRIRHTVPGL SGVGDDECWA
AVFRDPRWSF RYIVDRSLRR PREIIQYCSH ALEHARQNRS AGGQRIARRD ILAVEATYSG
ERTRDIAAEY RFQHPGLLSV FEAFRGRPAV WARDDLEFLL LDIATGAVRT SREATKWIAD
RDPDHLLETL WNVGFIRDHS VLSASWAGSD RSDSLTSTRG VSTFAVHPMF RQYVGTLD