Gene Francci3_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1084 
Symbol 
ID3906427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1293411 
End bp1295426 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content73% 
IMG OID637878418 
Producthypothetical protein 
Protein accessionYP_480195 
Protein GI86739795 
COG category 
COG ID 
TIGRFAM ID[TIGR02677] conserved hypothetical protein TIGR02677 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0110678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTCC AGGCCTGCCA GACTCGGCCG TCGCTTCAGC CACCGACGAC CCCGCAGACT 
CCCGTCTTCC CCAGCAGTGT CACTGGTAGC CGGTCACCCG CCGTTCACGC GGTAGCATGG
TGGGCGATAC TGGGCGGTGT GACGAGCGAC GGACAGACCG GGCCGGCAGC CGGCGGTGAT
GCGCCGCCCC GGCAGCCGTT CGCGCACCTG AGCGCCCCCA ACGCGGACGT CTACCGGGAG
GTGCTGACGA CGTTCGCGCG GGCGCGGGAC CGGTTCATCG TCCACATGCG CCCCGAGGAC
GTGATCGCGG ATCTGGGGCG GCCGGGGCAG ACCGAGTCGA TCGTCGCCGC GCTGGACAAG
CTCGTTGAGT GGGGCAACCT GCGGGCCGAC CCGGACACCG GCCGGGTGAC CACGGTGGCG
GACTTCCACC GGGCGCGCTA CCTGTACCAG CTGACCCCGG CCGGGCAGGC GGCCGAGGAG
GCGATCGCCG TCTACGAGGT GGCGATCGGC CGCCGGGGCG CCCTGCAGTC GGTGGCGCTG
GAGGACATCG CCGCCCAGCT GCGCGCCCTG GTGGAGATGA CGGCCGGCGA CGCCGCCGAT
CTCGACCCGG CGAAGGTACA CCTGCTGCTG CTGGGGCTGG CGGAGCGGTT CACCGGACTG
GCCGACAACG CTCAGGCGTT CATGACGTCC CTCCGCCGAG TGATCGACTT CTCGGACGGT
GACGTGGACG CGTTCCTCGC CTACAAGCAA CGGCTCATCG ACTACATCAA CCGGTTCATC
GCCGAGCTGG CGAACCGGGG CGCGGAGATC GCCACCCTGC TGGGTCAGGT CGAGGGGGCC
GGCGTCGAAC GCCTGCTGCT ACTCGCGGCC CGTCGGGAGG CGACCGACGC CGTGCCCGAC
GTGCCCGAGG TATCCGGCTC CGGGGCATCC GGCTCCGGGG CATCCGGCTC CGGGGCATCC
GGCTCCGGGG CATCCGGCTC CGGGGCATCC GATTCCGGCC GGACCGGAGC CGGGCGAACC
GACGCTGAGC GGATTCGGAC CGCCTACGAC GCCGCGGTCG CCGCGGCCCT GGACGGCTGG
CGCAACCGCT GGCGCGGCCT GCACGACTGG TTCGTGTCGG CGGATTCGCG CCGTCCCTCC
CAGGCACGGT TGTTGCGGGG CGCGGCTATC ACCGCGATCA CCCAGCTCAT CGATACCGTC
GCCGCACTCA ACGAGCGCCG TACCGGCCGT TCGGACCGGT CGGCGGACTT CCGCACGCTC
GCTCGCTGGT TTGCCGAGGC TCCTGACGAC GCGGCGGCCC ACCGGCTGTG GCGGGCCGCA
TTCGGCCTGG CCTCGGCCCG GCATCTCACC GTCAGCCAGG AGACCGTCGC CGCCTGGCAG
GAGGACGAGC CGACACCGAA CACGCCGTGG CAGGATGCGC CGCCGATGCG GATCAGCCCG
CAGCTACGCC GGACCGGTTC CTACGAGCGG CGCGGAACGC CGAACCGGGT CACGGACCGC
GCGCGGCAGC GGCGGCTGCT CGCCGAGCAG GCCGCGCGGG AGGCGGAGCA GGCCGCGTCC
GCTCGGGCCC GGCTCGCCAC CGACGGGCCA GTGCTGCTGT CCCAGCTCGG CGTCCTGGAC
CGGCAGGCGT TCCGGCTGTT CCTCGGCCTG CTCGGGGACG CGCTCGCCGC CAGGCTGGCG
GGCGATACCG AGGTGGAGAC CACGTCCAGC GACGGCACGG TCCTGATCCG CCTCACCCTC
GTCCCCGGCG GGGGGGTGGC GCGGATCGAG ACCGAGGATG GCGTCCTGGA CGGCCCGGAA
CACACCGTCG AGATCGTCGA CCTGGCCGGT GCGGCATCGC GTCCGGCGGC CTGGAACGAC
CTGAACGACC TGAACGACTG GAGCGACTCG AACGACAGGA ATGACCGAAA CCGGCAAAAC
GATCTGAATG GCCGGATGGA CGGCCTCGCT CCGGTCGTAC TCGCCGGTGG GCCGGACCGG
CTCGAGCTGC TGGAGACCGG ACGGGTGTCC GGATGA
 
Protein sequence
MLVQACQTRP SLQPPTTPQT PVFPSSVTGS RSPAVHAVAW WAILGGVTSD GQTGPAAGGD 
APPRQPFAHL SAPNADVYRE VLTTFARARD RFIVHMRPED VIADLGRPGQ TESIVAALDK
LVEWGNLRAD PDTGRVTTVA DFHRARYLYQ LTPAGQAAEE AIAVYEVAIG RRGALQSVAL
EDIAAQLRAL VEMTAGDAAD LDPAKVHLLL LGLAERFTGL ADNAQAFMTS LRRVIDFSDG
DVDAFLAYKQ RLIDYINRFI AELANRGAEI ATLLGQVEGA GVERLLLLAA RREATDAVPD
VPEVSGSGAS GSGASGSGAS GSGASGSGAS DSGRTGAGRT DAERIRTAYD AAVAAALDGW
RNRWRGLHDW FVSADSRRPS QARLLRGAAI TAITQLIDTV AALNERRTGR SDRSADFRTL
ARWFAEAPDD AAAHRLWRAA FGLASARHLT VSQETVAAWQ EDEPTPNTPW QDAPPMRISP
QLRRTGSYER RGTPNRVTDR ARQRRLLAEQ AAREAEQAAS ARARLATDGP VLLSQLGVLD
RQAFRLFLGL LGDALAARLA GDTEVETTSS DGTVLIRLTL VPGGGVARIE TEDGVLDGPE
HTVEIVDLAG AASRPAAWND LNDLNDWSDS NDRNDRNRQN DLNGRMDGLA PVVLAGGPDR
LELLETGRVS G