Gene Francci3_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1583 
Symbol 
ID3903718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1898056 
End bp1899954 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content70% 
IMG OID637878920 
Producthypothetical protein 
Protein accessionYP_480688 
Protein GI86740288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGGCA GACCGCTGCT TCCGCCGGAG ATACCGGATC CCCGGCTCGG CCAGAACGCG 
GCTTCCCAGC CCCACGCCGC GGCTGCCCGG GCCCGCCTGC GCTCGGTCCG TCAAAGACGG
CCCGAGGACC CCGGAATCCG GCGGCCGGCG CGGTTGTCAG GCCCCTTCGC GACGGGCCTT
CGCTGGCTGT TTCCGCTGGC GGTCGGCATC ACCCTCTGGG GGCTGTCACT GCGGTGGATC
CACGTCGAGA ACCTCACCGA CTACGGTCTC CCGCCGGCAC TGCCCATCGC CTGGTACGTC
GGCCTGGGCG TGCTCCTGCT CGGCGCTGTC GGCGCCGTCG GGCGGGCCCG CTTCCAGCCG
TTCATCGCGG CGGCCTACCT GCTTGCCTGC ATTCTCGTGC TCTACGGCAC CCTTCCGCTC
ATCCTGGACG TTCCGCACTA CCCCTGGGTG TACAAGCACA TCGGCGTGGT CCGCTATATC
GAGCAGCATG GCGGCGTCGA CATCGACATC GATATCTACC ACCGATGGCC CGGCTTCTTC
GCCGGTACCG CGGTCTTCGG CTCGCTTGCC GGCCGACCCA ACCCGATCGC CTTCGCCGCC
TGGACCGAGG TCTTCTTCAC CGGCATCGAC GCCCTTCTCG TCTGGGCGGC CACCAGGACG
CTGCTTCGCG AGCCACGGAT CGCTGCGGGC GCGGCACTGG TCTTCGTCGT GACGAACTGG
GTCGGGCAGG ACTATTTCGC CCCCCAGGCC CTCAGCCGGA CGCTGGCCCT CGCCCTGTAC
CTGGTGCTCC TGCGGCAACT CCGCACCGGC GCGCCGAGGG CCATCCTGCG GCGGACCGTG
CGGTCCGCCG GAGTCCACAG CAGAAGGCCG CCGCTGCGCT TCGCCGCCCC GCCAGGCCCA
ACCTGGCCGC TGCCGGTGGC GGTCGGCGTC GTGCTGCTCC TGGACGGGGC GGTGGTGGTC
AGCCACCAGT TGACCCCGTA CCTGCTCCTC CTCGGAGTCG GTGCGCTGAC CGCGGCCGGG
CTTGTGCGGC CGCGGTGGGT GGTGGTCGCC ATGGCCGGTC TGACGATCGC CTATCTGATC
CCGCACTTCG CCTATGTTCA GGATCATTTC GGGGTCTTCA GCAGTCCCGA CCCGTTCGCG
AACGCCCGGT CAGAGGAGCC GCCGCTTCCT TCGTCGGGGA AGCTGGCCAA AACCACCGGA
CTGATCCTGA CCGGCACGGT CTGGGCCCTG GCCGGAGTCG GCGGGCTGCG CCGGCTGCAC
CGCGGTGACG GCCGGGTGCT CCCGCTCATC CTGCTGAGCA TCGCACCGGG CGTCATCCTG
TTCGGGCAGA GCTACGGCGG TGAAAGCATC CTGCGGGCGA TTCTGTTCTC CCTGCCCTGG
TGCGCGCCGT TGGTGTGCTG CGCCGTCGCG CCGATGACCG GGCGGTGGCG ACGGCGGCAC
GTCCTGGCGG GCTGCGGCGG GATTCTCCTC CTCGCCACGC TCTTCATGCC CGCCTTCTTC
GGGAAGACGG AACTGAATCT CATGAACGCC GACGAGGTCC AGGCCGGTGA TTATCTGTAC
GCGCACGCCC GGCCCGGGTC GGTGGCCGTG CTGACCGGGG CCACCTTCCC GAGCCGCTAC
GGGGCTCGAT ATCCCGAGGT GGACGATGTC CTGATCACCG ACACCTTCGG CGAGCGCGCA
ATCAGGCCAT CGACGGTGAA TGCCGTGGCC GACTTCATCG GAACGTACCC GGGACGGGGT
TACCTCGTTT TCTCGACGGC ACAGGAGGTG TATGACCGTG TATACCGGAC GGCGACCGCC
GGTGCGCTGC GGGCGGTGGA GACGGCCGTC CGCTCCTCGC CGCGGTTCCG CCTCTGGTAC
GCGACCCGGA ACACCCGTAT CTACGAACTC GCCGGATAG
 
Protein sequence
MTGRPLLPPE IPDPRLGQNA ASQPHAAAAR ARLRSVRQRR PEDPGIRRPA RLSGPFATGL 
RWLFPLAVGI TLWGLSLRWI HVENLTDYGL PPALPIAWYV GLGVLLLGAV GAVGRARFQP
FIAAAYLLAC ILVLYGTLPL ILDVPHYPWV YKHIGVVRYI EQHGGVDIDI DIYHRWPGFF
AGTAVFGSLA GRPNPIAFAA WTEVFFTGID ALLVWAATRT LLREPRIAAG AALVFVVTNW
VGQDYFAPQA LSRTLALALY LVLLRQLRTG APRAILRRTV RSAGVHSRRP PLRFAAPPGP
TWPLPVAVGV VLLLDGAVVV SHQLTPYLLL LGVGALTAAG LVRPRWVVVA MAGLTIAYLI
PHFAYVQDHF GVFSSPDPFA NARSEEPPLP SSGKLAKTTG LILTGTVWAL AGVGGLRRLH
RGDGRVLPLI LLSIAPGVIL FGQSYGGESI LRAILFSLPW CAPLVCCAVA PMTGRWRRRH
VLAGCGGILL LATLFMPAFF GKTELNLMNA DEVQAGDYLY AHARPGSVAV LTGATFPSRY
GARYPEVDDV LITDTFGERA IRPSTVNAVA DFIGTYPGRG YLVFSTAQEV YDRVYRTATA
GALRAVETAV RSSPRFRLWY ATRNTRIYEL AG