Gene Francci3_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2940 
Symbol 
ID3903755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3471601 
End bp3473919 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content67% 
IMG OID637880261 
Productprotein of unknown function DUF1524 RloF 
Protein accessionYP_482027 
Protein GI86741627 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0973906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCGA ACGAAACTAC TCTGGGTGAA CTCCTCCAGG GTCAGTGTCA GTACGTCGTC 
CCGTTGTATC AACGCCCCTA CAGCTGGGAA CGCGCCAACC TGCGGCAACT CTGGGCCGAC
ATCACGAGCG TGGCCGCCGC CGGTCCAGCG GCAACACACT TCCTGGGTTC GCTGGTTCTC
GCGCCGAGCC CGTCAACCAC GCCGGCCGGA GTGGCGATCT GGCTGGTGGT AGATGGTCAG
CAACGGCTGA CGGCGCTGAG CATCCTGCTC TGCGCGATCC GCGATCATGT CCGAGATGAC
GATCAGATGC TCGCGGCGAA AATCGACGAT CTGTACCTCA TGAACAGGTA CGCGGCCGGA
ACCGAACGTT ACACCCTGTT ACCGACGCGA GCCGACCGGA CCGCCTGGAC CGCTCTCGTC
GAGCGGTCGC CGGAGGCAGG AGGCCGGGCG GGGATCGGCG ACGCATACCA GTTCTTCCGC
AAGGAACTCG CCGCGTTGCG CGACGCCGAC GACCCGCTGG ACGCAGCACT GATCGAGCAG
GCCGTGGTCG GTCAGCTCGC AATCGTGGAG ATCGCCGCGC ATGTCGACGA CGACGTCTAC
CGCATCTTCG AATCGCTCAA CCACACCGGC CGCCGGCTCA CCCAGGCGGA CCTGCTGCGC
AACTACCTGT TCATGCGGCT ACCCACCCGA GCGGACCGGG TGTACGACTG GCGGTGGTTC
CCACTCCAGG AACTGCTCGG TGACAGACTG GAGGATCTGG TCTGGCTCGA CCTCGTGCTC
CGCGGCGACG ACCGTGCCAC CAAGGAGACC GTGTACCAGT CGCAGCGGCA GTATCTCCAG
ACGCTGCCGG ACGAGGACGC CATCGAGCAG TGGATCTCCG AGCTGCATGC AAAGGCCCTG
CTGTTCAGCC GGATCCTCGA TCCGGACCGG GAGGAGGACC CGGTCCTGCG GCAGGCCCTG
CACCGGCTGC GCCGGTGGGG GGCGGACGTC GTCCGGCCGA TCGTCCTACA CATGCTGATC
GCGCATGCGA ATGACCGTCT CGACGCCACC GAGACCGCCG CGGCACTCCG GGTGGTGGAG
AGCTACCTGG TGCGGCGGAT GCTGGTCGGC ATCGCGAGCG CCAACAGCAA CCGCATCCTC
ATGTCGCTCG TCAAGGAGCT CGGCGACCAG ACACCGACCG CCGCCGCGGT CACGCGGGTT
CTCTCCGGCC CGCGTAAGAA GTTCCCGACC GATCAGCCTG TGAAGGAAGC CGTCCTGCTC
AATCCCTTCT ACTGGACCGG ACGCGGCCCT CAGCGCACCT ATGTGTTGCG CAGCCTGGAA
GAGGATTACG AGCATCTCGA ACCAGTCGAC TGGGGCGTGA AGCTGACGGT TGAACACATC
CTCCCCCAGT CATTGTCGAG ACCGGGATGG AAAGCGGTTC TCGACGCCGA TGCCCACGAG
GGTGAGACGC GCGACGAGCT GCACCGCAGG CTCGTCCATA CTCTGGGCAA TCTGACGCTC
ACCGCCTATA ATCCGAAACT CGCCGATCAC GAGTTCACCG AGAAGAAGAA GCTGCTGGCC
GACAGCGGGC TGGCGATGAA CCGAGAAATC GCCGGTCAGG ACAGATGGGG CCGGGAGGAG
ATCCAGAATC GTGGCCGGGC GCTCGCCGAG CGGATCGTGA AGATCTGGCC CGGCCCGGAC
GAGTCCGTAG TACCGCCGCC CCAGGACCAG CGATGGACCC TGATGAGCCG GGTACTGGCC
GCGATCCCAC CCGGCCGCTG GACCAGTTAC TCCGACGTCG CGGAGGTGAT CGGGTCCCAC
GCCGTCGCGG TGGGAGCCAA GCTGGCCAGT GCCCGGATCT CCAACGCCCA CCGCGTCCTG
CTCCTGAACG GTTCCGTCTC CCCGGACTTC CGCTGGCCCG ATCCGGAACG CACCGACGAT
CCACGGGAGA TCCTGACCGC CGAGGGTGTC ATTCTCAGCA GGTCGGGCCG GGCACTGGCC
CGCCAACGCA TGACCGCCGC CGAACTCGCC GCCGCCGCCG ATCTCGAAAC CCCGGAAGAG
AGCCAGGGCG CCGCCGGAAC CAAGGAACGT CTGTGGGCCG CCCCGGCCGC CCGGTCCGCC
GTGGGGCTCG CGGTCAGCCT GTGGCCATCA GAGCGGCGCG CGGCGGCGCC CACGGACGAT
GCGAACTCCG GCCTCTCCTA CCGGGACCGG GTCCGGGGCT GCCTGCTGGG CGGGGCGTTG
GGAGACGCGC TCGGTGCGGC AATCGAATTC CAGTCGCTTG ACGAGATCCG ACGGGAGTAC
GGGACCAGGG TGACCCGCAA AGTCTGTTTG TGCAGGTAG
 
Protein sequence
MKANETTLGE LLQGQCQYVV PLYQRPYSWE RANLRQLWAD ITSVAAAGPA ATHFLGSLVL 
APSPSTTPAG VAIWLVVDGQ QRLTALSILL CAIRDHVRDD DQMLAAKIDD LYLMNRYAAG
TERYTLLPTR ADRTAWTALV ERSPEAGGRA GIGDAYQFFR KELAALRDAD DPLDAALIEQ
AVVGQLAIVE IAAHVDDDVY RIFESLNHTG RRLTQADLLR NYLFMRLPTR ADRVYDWRWF
PLQELLGDRL EDLVWLDLVL RGDDRATKET VYQSQRQYLQ TLPDEDAIEQ WISELHAKAL
LFSRILDPDR EEDPVLRQAL HRLRRWGADV VRPIVLHMLI AHANDRLDAT ETAAALRVVE
SYLVRRMLVG IASANSNRIL MSLVKELGDQ TPTAAAVTRV LSGPRKKFPT DQPVKEAVLL
NPFYWTGRGP QRTYVLRSLE EDYEHLEPVD WGVKLTVEHI LPQSLSRPGW KAVLDADAHE
GETRDELHRR LVHTLGNLTL TAYNPKLADH EFTEKKKLLA DSGLAMNREI AGQDRWGREE
IQNRGRALAE RIVKIWPGPD ESVVPPPQDQ RWTLMSRVLA AIPPGRWTSY SDVAEVIGSH
AVAVGAKLAS ARISNAHRVL LLNGSVSPDF RWPDPERTDD PREILTAEGV ILSRSGRALA
RQRMTAAELA AAADLETPEE SQGAAGTKER LWAAPAARSA VGLAVSLWPS ERRAAAPTDD
ANSGLSYRDR VRGCLLGGAL GDALGAAIEF QSLDEIRREY GTRVTRKVCL CR