Gene Francci3_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0123 
Symbol 
ID3903453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp153065 
End bp154531 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content65% 
IMG OID637877456 
Producthypothetical protein 
Protein accessionYP_479246 
Protein GI86738846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCG ACAGCGGCAG GCCCAGCGCG AGCGGCGCAC GCGTCACCGG CGATGACCTC 
CAGTACGCAG TCGCCTGGCA CGCCGCGCTG CGCACCCTTG TGCCACACTC GGGCGCCAAC
GCCGTCACTG TCGAGGCGGT GACGGCCGGC AACGTGGACG ACGTCGTCAT CGGGAAGGCC
CACGGGCCGG ACGACTACAT GCAGGTCAAG GCCAGCGTCA CAGCCGAGAA AGCAGCGACC
ATTGAGTGGC TGACCGCGCT GTCGGGCAAG CGAGGCCCCA GCATTCTCCA GCGGTTCTAC
CGCACCTCGC AGCAGCTGCG GGTCGACGGT GCCCACCCGA GGCTGACCCT GGTCACGAAT
CGGTCCATCC ACCCCGACGA CCCGGTGCTC ACCCTGCGAG ACCGCAATGA TCACTTGGCG
GATCGGCTGT GCACCGCGAC TAATGCAGCT ACAGCGGCCG GACGTCGAAA CCTACTCCGT
CACCTCGACT GCACCGACGA CGAGCTGTAC GAATTCCTGT CCAACCTGCG GCTACACACC
GACGCATCCG AAGCTGCCTG GCGCGACTAT CATATCCGAG ACATAAGCCA CGCGGCGGGT
GTCCAAGCAG ACGAAGTCGC CTACCGGCTC GGAATCGCCG AAGTTCGAGA GTGGGTCAAA
ACCAGCCGCA GCCAGAAACG ACCAGCCGAC ATCGCGGCCG CCATCGACCG CCTCGGCATC
CGGGCGCAAG AGCCATTCAC CATGGTTGCC ATCAACGCCC TCGACGAAGG CTTCACAAAC
CCGGACGCCC GCGTGACGCT CGACTGGGTG GACCGATTCC GAGGAAGCGA GGCCCGCAGC
CGACGCGGTC TCAAGAACCC CAAGGAATGG GAAACAGTTC TTCGGCCACA ACTCATCGAC
GCTCAGCGGA CGTTGCGCAG CCTCGGCGCG AAACGCATCC TCATCACCGG CACCATGCGA
CTGCCGACCT GGTTCACCGC TGCCGTCATG TTCCAGGAGA CAGCCGGATT TATCCCCGCC
AAGACCAAGG ACGGCCAACT GTGGCTCAAA CCCGGCGGAA CGATCATGCC CGCTTCCATC
TGCCTCTCGT CATCGATCGC TGAACTTCGG CCAGGCAGCG AGGTCGCACT CGCGCTCGCG
ATCTCCGCCG ACCTGACCCC AGACGTCACC AGCTACATCG CTAGCACTGG ACGCGATATA
CCCATCATCA CGATCAGCCT GCCCTCCGGA ATCTCGGGTA GTAGCATCGC CGACAGAGAT
CACGCATATG CCGTCGCGTT GGCCGTTCGC GACCTGAGCC GGCAGATCGC GCGCCGTGTC
AATCCGCCAA TACTCCACCT CTTCATGGCG GCGCCATCCG GCTTTGCGGT GCTACTCGGC
GGCGTGTGGG ATCGGGTTCC CGCCACCCAG ACCTACGAAG ACCTCGCCGC GTCCGGCTAC
GAGCCGGCGT TCTTCCTGCC CAACTGA
 
Protein sequence
MAADSGRPSA SGARVTGDDL QYAVAWHAAL RTLVPHSGAN AVTVEAVTAG NVDDVVIGKA 
HGPDDYMQVK ASVTAEKAAT IEWLTALSGK RGPSILQRFY RTSQQLRVDG AHPRLTLVTN
RSIHPDDPVL TLRDRNDHLA DRLCTATNAA TAAGRRNLLR HLDCTDDELY EFLSNLRLHT
DASEAAWRDY HIRDISHAAG VQADEVAYRL GIAEVREWVK TSRSQKRPAD IAAAIDRLGI
RAQEPFTMVA INALDEGFTN PDARVTLDWV DRFRGSEARS RRGLKNPKEW ETVLRPQLID
AQRTLRSLGA KRILITGTMR LPTWFTAAVM FQETAGFIPA KTKDGQLWLK PGGTIMPASI
CLSSSIAELR PGSEVALALA ISADLTPDVT SYIASTGRDI PIITISLPSG ISGSSIADRD
HAYAVALAVR DLSRQIARRV NPPILHLFMA APSGFAVLLG GVWDRVPATQ TYEDLAASGY
EPAFFLPN